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WO 98/06048 PCT/CA97/00539 
MOLECULAR MODELLING OF NEUROTROPHIN-RECEPTOR BINDING 

FIELD OF THE INVENTION 

5 The present invention relates to a computational method for identifying minimum- 

energy structures in molecular systems, and more particularly for determining the bioactive 
conformations of peptide domains. The invention relates to determination of the bioactive 
conformations of the amino and carboxyl termini of Nerve Growth Factor (NGF) and 
analogues thereof, and the 3-dimensional structures of their complexes with the 24-amino 
1 0 acid second leucine-rich motif (LRM) of TrkA which represents the binding site for NGF . The 
invention further relates to the theoretically identified geometries of the TrkB and the pZS 1 ™ 
receptor binding sites and to the mechanisms of neurotrophin binding. 



BACKGROUND OF THE INVENTION 

15 

The neurotrophins are a family of structurally and functionally related proteins, 
including Nerve Growth Factor (NGF), Brain-Derived Neurotrophic Factor (BDNF), 
Neurotrophin-3 (NT-3), Neurotrophin-4/5 (NT-4/5) and Neurotrophin-6 (NT-6). These proteins 
promote the survival and differentiation of diverse neuronal populations in both the peripheral 

20 and central nervous systems (Hefti, 1986; Hefti and Weiner, 1986; Levi-Montalcini, 1987; 

Barde, 1989; Leibrock et a/., 1989; Maisonpierre et a/., 1990; Rosenthal et a/., 1990; Hohn et 
a/., 1990; Gotz et a/., 1994; Maness et a/., 1994) and are involved in the pathogenesis of 
diverse neurological disorders. Neurotrophins exert certain of their biological effects through 
specific interactions with a class of transmembrane receptor tyrosine kinases (TrkA, TrkB and 

25 TrkC) (Kaplan et a/., 1991; Klein et a/., 1991, 1992; Soppet et a/., 1991; Squinto era/., 1991; 
Berkemeier et a/., 1991; Escandon et a/., 1993; Lamballe et aL t 1991). Specificity of 
neurotrophin action results from their selective interactions with Trk. That is, TrkA only binds 
NGF (Kaplan et ai t 1991; Klein et at.. 1991); TrkB binds BDNF, NT-3 and NT-4/5 (Soppet et 
a/., 1991; Squinto et a/., 1991; Berkemeier et ai ( 1991; Escandon etai, 1993; Lamballe et 

30 a/., 1991.; Klein et a/., 1992; Vale and Shooter, 1985; Barbacid, 1993); and TrkC exclusively 
binds NT-3 (Lamballe et a/., 1991; Vale and Shooter, 1985). This is particularly evident when 
the Trk receptors are coexpressed with the common neurotrophin receptor p75 NTR (For review 
see Meakin and Shooter, 1992; Barbacid, 1993; Chao, 1994; Bradshaw et a/., 1994; Ibanez, 
1995). 

35 The common neurotrophin receptor p75 mR is a transmembrane glycoprotein 

structurally related to the tumor necrosis factor and CD-40 receptors (Meakin and Shooter, 

1 
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1992; Ryden and Ibanez, 1996). As all neurotrophins bind to p75 NTR with similar affinity 
(Rodriguez-Tebar ef a/. t 1990; Hallbook et a/., 1991; Rodriguez-Tebar et ai , 1992; Ibanez, 
1995), neurotrophin specificity is conventionally thought to be caused by the binding 
selectivity for Trk receptors which are differentially expressed in different neuronal populations 
5 (Ibanez, 1995). However, accumulated experimental data on neurotrophin activity reveal 

important functional aspects of p75 NTR (Heldin et a/., 1989; Jing et at., 1992; Herrmann, 1993; 
Barker and Shooter, 1994; Dobrowsky et ai , 1994, Matsumoto etaL, 1995; Marchetti etaL, 
1996; Washiyama et a/., 1996). The common neurotrophin receptor enhances functions and 
increases binding specificity of Trk receptors (Barker and Shooter, 1994; Mahadeo et a/., 

10 1994; Chao and Hempstead, 1995; Ryden and Ibanez, 1996). In addition, p75 NTR possesses 
unique, Trk-independent signalling properties which involve ceramide production through 
activation of the sphingomyelin cycle (Dobrowsky et a/., 1994), apoptosis (cell death) (Van der 
Zee et aJ. t 1996; Cassacia-Bonnefil et a/., 1996; Frade et ai, 1996), and activation of the 
transcription factor NFkB (Carter et ai, 1996). Recently, p75 NTR has been demonstrated to 

15 participate in human melanoma progression (Herrmann etaL, 1993; Marchetti et a/., 1996). 
Furthermore, NGF and NT-3 increase the production of heparin by 70W melanoma cells, 
which is associated with their metastatic potential (Marchetti et a/., 1996). Although this 
effect has been shown to be mediated by the common neurotrophin receptor, neither BDNF 
nor NT-4/5 appeared to be active. Explicit modelling of the various neurotrophin receptors 

20 will be critical to the rational design of neurotrophin-based drugs. 

NGF displays high and low affinity binding sites in sensory and sympathetic neurons 
and in pheochromocytoma PC12 cells (Sutter et a/. t 1979; Landreth and Shooter, 1980; 
Schechter and Bothwell, 1981). The coexpression of the common neurotrophin p75 NTR 
receptor with TrkA is required to form the high affinity binding site (Hempstead et a/., 1991; 

25 Barker and Shooter, 1994; Mahadeo e/a/., 1994; Chao and Hempstead, 1995). Several 
models of the TrkA-p75 MTR interaction have been proposed to explain high affinity NGF 
binding (Bothwell, 1991; Chao, 1992b; Chao and Hempstead, 1995; Wolf et a/., 1995; Ross et 
a/., 1996; Ross et a/., 1997). These models differ with respect to direct (conformational 
model) or indirect (ligand-presentation model) interaction of p75 NTR with TrkA. Direct TrkA- 

30 p75 NTR interaction is consistent with much of the existing experimental data. 

NGF, a 118 amino acid protein, is an extremely important neurotrophin, being 
implicated in the pathogenesis of Alzheimer's disease, epilepsy and pain (Ben and Represa, 
1990; McKee etaL, 1991; Leven and Mendel, 1993; Woolf and Doubell, 1994; Rashid etaL, 
1995; McMahon et a/., 1995). The binding of NGF to its receptors is determined by distinct 

35 sequences within its primary amino acid structure. While several regions of NGF participate 
in the NGF/T rkA interaction, mutation studies suggest that relatively few key residues, namely 

2 



>: <WO 9B06048A2„L> 



WO 98/06048 



PCT/CA97/00539 



those located in the NGF amino and carboxyi termini, are required for high affinity binding. 

The hairpin loop at residues 29-35 is responsible for recognition by p75 NTR (Ibafiez et 
a/.. 1992; Radziejewski et al., 1992), while the amino and carboxyi termini are important 
binding determinants for recognition by the TrkA receptor (Shih et a/., 1994; Moore and 
Shooter, 1975; Suteref a/., 1992; Burton etal., 1992, 1995; Kahle eta/., 1992, Luo and Neet, 
1992; Drinkwateref a/., 1993; Treanor et at., 1995; Taylor et a/., 1991). Truncation of either 
the amino or carboxyi terminus of NGF produces less active NGF analogues; similarly most 
deletion or point mutations of the amino terminus also lead to NGF analogues with diminished 
activity (Shih et a/., 1994; Burton et at., 1992, 1995; Kahle et at., 1992; Drinkwater et al.. 
1993; Treanor et at., 1995; Taylor et at.. 1991). On the other hand, the NGFA2-8 (NGF with 
residues 2-8 removed) and NGFA3-9 deletion mutants are almost as active as wild type NGF 
(Drinkwater et al., 1993). These NGF structure-activity relationships in combination with the 
considerable species variability (mouse, human, guinea pig and snake) of the amino acid 
sequence of the NGF termini (McDonald et at., 1991) are of potential value in understanding 
the NGF/TrkA interaction. 

NGF exerts its biological activity as a non-covalent dimer (Treanor etal., 1995; Burton 
era/., 1995; McDonald etal., 1991; Ibanez etal., 1993; Bothwell and Shooter, 1977). Two 
118 residue NGF monomers are dimerized by hydrophobic and van der Waals interactions 
between their three anti-parallel pairs of 0-strands; consequently, the amino terminus of one 
NGF monomer and the carboxyi terminus of the other are spatially juxtaposed (McDonald et 
at., 1991). Furthermore, although a dimer has 2 pairs of termini, only one pair of termini is 
required for TrkA receptor recognition (Treanor et al., 1995; Burton et at., 1995). Accordingly, 
solving the conformation of the complex formed by the amino terminus of one monomer and 
the carboxyi terminus of the other in dimeric NGF is of fundamental relevance to 
understanding the interaction of NGF with TrkA. 

The X-ray crystallographic 3-dimensional structure of a dimeric mouse NGF (mNGF) 
has been reported recently (McDonald etal., 1991). However, within this structure, the amino 
terminus (residues 1-11) and the carboxyi terminus (residues 112-118) remain unresolved for 
both pairs of termini. High flexibility of the NGF termini makes it difficult to experimentally 
determine their bioactive conformations, particularly since transition metal ions commonly 
used in X-ray crystallography (McDonald et al., 1991) have high affinity for His residues 
(Gregory et al., 1993) which are present in the NGF amino terminus (Bradshaw et al., 1994). 
Indeed, conformational alterations in the receptor binding domains of NGF caused by Zn J * 
cations leading to its onactivation haw© been described recently (Ross et al., 1997). Since the 
amino and carboxyi termini are crucial for NGF bioactivity as mediated via TrkA and because 
of the significance of NGF in multiple neurologic disease processes, the determination of the 
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biologically active conformation of these termini is an important and challenging problem for 
computational chemistry. 

In contrast with NGF, little is known about the BDNF binding determinant for TrkB 
activation. Although there were several attempts to identify structural elements of BDNF that 
5 determine its receptor specificity (Ibanez et a/., 1991 t 1993, 1995; Suter et a/., 1992; Hag et 
a/., 1994; Kullander and Ebendal, 1994; Urfer et a/., 1994; Lai et ai % 1996), the results of 
mutational experiments are still controversial. Thus, the reported importance of residues 40- 
49 (variable loop region II) for TrkB related activity of BDNF (Ibanez e/a/., 1991; llag et a/., 
1994) has not been confirmed by recent studies (Lai et a/., 1996), in which it has also been 

10 shown that the key residues of the TrkB receptor binding domain of BDNF are situated 

between residues 80 and 109. The latter result is consistent with earlier observations that the 
BDNF termini are not involved in TrkB binding and that the key residues of the BDNF binding 
epitope are Arg 81 and Gin 84 (Ibanez et a/., 1993). In addition, it has been found that receptor 
binding domains of BDNF and NT-3 are located in the same regions of the molecules, i.e. in 

1 5 their "waist" parts (Ibanez et al. , 1 993; Urfer et a/. , 1 994). As far as the NT-3 receptor binding 
domain is concerned, detailed mutational studies carried out by Urfer et a/. (1994) revealed 
that it comprises a surface that includes major parts of the central (J-strand stem, with the 
most important residues being Arg 103 , Thr 22 , Glu* Arg 56 . Lys 80 and Gin 83 . These residues 
correspond to Arg 104 , Thr 21 , Glu 55 , Lys 57 t Arg 81 and Gln w of BDNF, respectively (McDonald et 

20 a/., 1991). 

The location of the neurotrophin binding sites within the Trk receptors is the subject of 
debate (MacDonald and Meakin, 1996). Published data indicate the existence of two putative 
neurotrophin binding sites of Trk proteins: the "second immunoglobulin-like domain" 
(residues Trp^-Asn 365 *, denoted lgC2) and the "second leucine-rich motif (LRM-2A and 

25 LRM-2B, residues Thr^-Leu 120 of TrkA and TrkB, respectively) (Schneider and Schweiger, 
1991; Kullander and Ebendal, 1994; Perez etai, 1995; Urfer et a/., 1995; Windisch et at., 
1995a, 1995b, 1995c; MacDonald and Meakin, 1996; Ryden and Ibanez, 1996). 

p75 mR belongs to a family of cell surface proteins that share a common pattern of four 
repeated cysteine-rich domains (CRDs) in the extracellular portion (Yan and Chao, 1991; 

30 Baldwin and Shooter, 1994). X-ray crystallographic studies on the extracellular portion of the 
related protein p55 TNFR revealed that CRDs fold independently of each other, and three 
conserved disulfide bonds maintain a specific geometry of each CRD which consists of two 
loops (Banner et a/., 1993) (loops A and B, Fig. 1a). Very little is known about the p75 NTR 
receptor functional epitope and, particularly, about residues directly participating in molecular 

35 recognition processes. All four CRD repeats of p75 NTR are found to be required for binding, 
with the second CRD being most important (Baldwin and Shooter, 1995). Residue Ser 50 of 

4 
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p75 NTR seems to be essential for NGF binding (Baldwin and Shooter, 1995). 

The results of site-directed mutagenesis suggest that the p75 NTR receptor binding 
domains within neurotrophins are juxtaposed positively charged residues located in two 
adjacent hairpin loops which represent variable region I (residues 23-35) and V (residues 93- 
98) (Ibanez e/a/., 1992; Ryden etal., 1995; Ibanez, 1994; Ibanez, 1995; Ryden and Ibanez, 
1996) (Fig. 1c). In NGF, residues Lys 32 , Lys 34 , and Lys 95 have been found to be involved in 
p75 N7R receptor binding, with Lys 32 making the strongest contact, followed by Lys 34 and Lys 95 . 
In addition, some role of Asp 30 , Glu 35 , Arg 103 , Arg 100 , Lys Bfi , and He 31 in p75 MTR binding and 
biological activity has been demonstrated (Ibanez et a/., 1992; Ibanez, 1994). Two residues 
of variable region I, namely Arg 31 and His 33 , have been demonstrated to be essential for 
binding of NT-3 to p75 NTR , whereas similarly located Arg 34 and Arg 36 mediate binding of NT- 
4/5 (Ryden et a/. t 1995). In contrast to NGF, a positively charged residue in variable region V 



__ *lf KYT-1 rsr KVT-AtE* kr\ t\~7GWTK /Dw«4am r*4 ^/ H OnC\ 1^ DPiKrr I 

nwk \*\ iiiwoi i wi k/n imii t£j wi ■ i 1 ~w w i i^i — »w iw i +j yi\jru€ti c^t ai. , I II I Ot/hr , I IUWCVCI , 

only residues of variable region V, Lys 95 , Lys 96 and Arg 97 , bind to p75 NTR and compensate for 
the lack of positively charged residues in variable loop region I (Ryden et a/., 1995). 

Therefore, it is an object of the invention to establish explicit atom-level models of the 
Trk and p75 NTR recognition sites for neurotrophins. 

It is a further object of the invention to provide atom-level models of the mechanism of 
interaction of neurotrophins and their receptors. 

SUMMARY OF THE INVENTION 



The inventors describe a method of characterizing the interaction between 
non-contiguous domains of a ligand and its receptor. The specific examples of the method 
described in detail below concern the interaction of nerve growth factor (NGF) with its TrkA 
receptor binding site, brain-derived neurotrophic factor (BDNF) with its TrkB receptor binding 
site, and the interaction of the neurotrophins (e.g., NGF, BDNF, NT-3, and NT-4/5) with the 
neurotrophin binding site of the common neurotrophin receptor p75 NTR . 

In one aspect of the invention there is provided a variable basis Monte-Carlo (VBMC) 
simulated annealing method for identifying an optimal structure in a molecular system. The 
method comprises: 

(a) providing a Markov chain with an initial basis set of N configurational variables 
which define a structure for said molecular system; 

(b) translating the N configuratoonal variables simultaneously along a basis vector 
to produce a new structure for the molecular system, wherein magnitude of translation is 
chosen randomly within a preselected range; 
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(c) calculating potential energy of the new state of the molecular system; 

(d) deciding whether to accept or reject the new structure using an effective 
temperature dependent transition function, and if the new structure is accepted, replacing the 
current structure with the accepted structure; 

(e) repeating steps (b) to (d), each repetition having a new basis vector, for a 
preselected first number of repetitions at a preselected upper temperature; then 

(f) decreasing temperature according to a preselected cooling schedule; 

(g) repeating steps (b) to (d) and (f), each repetition having a new basis vector, for 
a preselected second number of repetitions; then 

(h) storing current structure of the molecular system; 

(I) repeating steps (g) and (h) until a preselected number of stored structures 
have been accumulated; then 

(j) rotating the basis set of configurational variables according to an effective 
distribution of the accumulated structures so that basis vectors are directed along low-energy 
valleys of a potential energy hypersurface thereby accelerating conformational motions and 
structural transitions of the molecular system, and erasing accumulated structures from steps 
(h) and (I); 

(k) repeating steps (I) and Q) until, after an effective third number of repetitions, a 
preselected lower temperature limit is reached, thereby producing an unrefined global 
minimum structure; and 

(I) refining the global energy minimum structure so that at least one local 
minimum on the potential energy hypersurface is identified. 

The present invention also provides a method of identifying structures of a peptide 
domain of a neurotrophin to bind with a binding site of a receptor. The method comprises 
providing a plurality of neurotrophin analogues wherein at least one of neurotrophin analogue 
binds the receptor, identifying an optimal or near-optimal structure for respective peptide 
domains of the neurotrophin analogues that bind the receptor by applying a preselected 
number of the variable basis Monte-Carlo (VBMC) annealing simulations described above to 
each of the respective peptide domains to identify minimum or near-minimum energy 
structures for the peptide domains, and comparing said minimum or near-minimum energy 
structures to identify common structural features shared by neurotrophin analogues exhibiting 
receptor mediated activity. 

The step of providing neurotrophin analogues may include providing wild-type 
neurotrophin analogues or muHants thereof. 

The present invention also provides a method of identifying structures of a domain of 
a ligand to bind with a binding site of a receptor, comprising providing a plurality of ligand 
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analogues wherein at least one ligand analogue binds the receptor, identifying an optimal or 
near-optimal structure for respective domains of the ligand analogues that bind the receptor 
by applying a preselected number of the variable basis Monte-Carlo (VBMC) annealing 
simulations described above to each of the respective domains to identify minimum or near- 
5 minimum energy structures for the domains; and comparing said minimum or near-minimum 
energy structures to identify common structural features shared by ligands exhibiting receptor 
mediated activity. 

As would be apparent to a person skilled in the art, methods of the present invention 
are not limited to neurotrophin-receptor interactions, but are applicable to the characterization 

10 of other ligand-receptor interactions. That is, an array of ligands that are either known to 

interact with a specific receptor or are candidate ligands that may potentially interact with this 
receptor may be computationally analyzed to identify optimal or near-optimal structures that 
provide the desired interaction. In some embodiments of the invention, the ligand may have a 
peptide domain, e.g., the ligand may be a peptide hormone, such as insulin, prolactin, or 

1 5 growth hormone. In some embodiments, the ligands or candidate ligands under study may 
be native ligands, mutants of native ligands, or isosteres of native ligands. They may be 
structurally different but have similar binding activity or functional effect(s). The optimal or 
near-optimal structure identified may act as an agonist or antagonist. "Isosteric" is an 
art-recognized term and, for purposes of this disclosure, a first moiety is "isosteric" to a 
20 second when they have similar molecular topography and functionality. This term should not, 
therefore, be restricted to mere similarity in chemical bond configuration. Two compounds 
may have different chemical bond configurations but nonetheless be isosteric due their 
3-dimensional similarity of form and chemical nature, e.g. relevant centers such as 
heteroatoms are located in the same position, charges and/or dipoles are comparable, and 
25 the like. 

The present invention provides a ligand for binding with TrkA, wherein TrkA comprises 
a leucine rich motif (LRM) comprising amino acid residues 93 to 1 17, the coordinates of the 
residues of said LRM given in Appendix 1, said LRM motif having five binding areas, area A 
comprising amino acid residue Phe 105lA capable of hydrophobic bonding, area B comprising 

30 residues Phe 11UA , Phe 113lA , and Thr 114tA capable of hydrophobic bonding, area C comprising 
residues Asp 109lA and His ,12tA capable of ionic interaction, area D comprising residue Lys 100lA 
capable of ionic interaction, and area E comprising residues Asn 95 * to He 98 * capable of 
multiple parallel (J-strand type hydrogen bonding. The ligand includes at least three moieties 
each comprising effective atomic elements having spatial occupancy, relative atomic position, 

35 bond type and charge to define a three dimensional conformation of said ligand so that a first 
moiety can bind with a first binding area, a second moiety can bind with a second binding 
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area, and a third moiety can bind with a third binding area. 

Put another way, the ligand comprises at least three moieties effectively spaced 
relative to each other so that a first moiety can bind with a first binding area, a second moiety 
can bind with a second binding area, and a third moiety can bind with a third binding 
5 area. More specifically, the ligand comprises at least one moiety being capable of hydrophobic 
bonding and being present in an effective position in the ligand to hydrophobically bind to area 
A, at least one positively charged moiety being present in an effective position in said ligand 
to ionically bind to area C, and at least one negatively charged moiety being present in an 
effective position in said ligand to ionically bind to area D. 
10 Tne Present invention also provides a method of designing a ligand to bind with an 

LRM of TrkA. The method comprises providing a template of said LRM of TrkA, said LRM 
comprising amino acid residue sequence 93 to 117 inclusive, said amino acid residues having 
spatial Cartesian coordinates given in Appendix 1 wherein binding area A in the sequence 
comprises residue Phe 105tA capable of hydrophobic bonding, binding area B comprises 
1 5 residues Phe 1 "", Phe 113tA . Thr 114tA capable of hydrophobic bonding, binding area C comprises 
residues Asp 109tA and His 112tA capable of ionic interaction, binding area D comprises Lys 100lA 
capable of ionic interactions, and binding area E comprises residues Asn 95 * to He 98 * capable 
of multiple parallel 0-strand type hydrogen bonding and computationally evolving a chemical 
ligand using any effective algorithm with preselected spatial constraints so that said evolved 
20 ligand comprises at least three effective moieties of suitable identity and spatially located 

relative to each other in the ligand so that a first of the moieties hydrophobically interacts with 
binding area A, a second of the moieties ionically interacts with binding area C, and a third of 
the moieties ionically interacts with binding area D of the LRM of TrkA. The algorithm may be 
a genetic algorithm and the LRM may be the second LRM of a plurality of LRMs. 
25 Tne invention further provides a method of designing a ligand to bind with an 

LRM of TrkA, comprising providing a template of said LRM of TrkA. said LRM comprising 
amino acid residue sequence 93 to 117 inclusive, the amino acid residues having spatial 
Cartesian coordinates given in Appendix 1 wherein binding area A comprises Phe 105tA capable 
of hydrophobic bonding, binding area B comprises Phe 11m , Phe 113tA , Thr 11 "* capable of 
30 hydrophobic bonding, binding area C comprises Asp ,09tA and His 112lA capable of ionic 

interaction, binding area D comprises Lys 100lA capable of ionic interaction, and binding area E 
comprises Asn 95 * to He 98 * capable of hydrogen bonding. The method comprises 
computationally evolving a ligand using an effective algorithm so that said evolved ligand 
comprises at least three effective moieties located relative to each other in the ligand so that 
35 a first moiety can bind with a first of the five binding areas, a second moiety can bind with a 
second of the five binding areas, and a third moieiy can bind with a ihird of the binding areas. 
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In another aspect the invention provides a ligand for binding with TrkB, wherein TrkB 
comprises a leucine-rich motif (LRM) comprising amino acid residues 93 to 1 17, the 
coordinates of the residues of said LRM given in Appendix 2, said LRM having five binding 
areas, binding area F comprising amino acid residue Asp 100,B capable of ionic interaction, 
5 binding area G comprising amino acid residue Lys ,D9,B capable of ionic interaction, binding 
area H comprising amino acid residues Lys 113tB capable of ionic interaction, binding area I 
comprising amino acid residue Arg M,B capable of ionic interaction, and binding area K 
comprising amino acid residue Lys 1M,B capable of hydrogen bonding or ionic interaction. The 
ligand comprises at least three moieties effectively spaced relative to each other so that a 
1 0 first moiety can bind with a first binding area, a second moiety can bind with a second binding 
area, and a third moiety can bind with a third binding area. 

The present invention provides a method of designing a ligand to bind with TrkB, 
wherein TrkB comprises a leucine-rich motif (LRM) comprising amino acid residues 93 to 117 
the coordinates of the residues of said LRM given in Appendix 2, said LRM having five 
1 5 binding areas, binding area F comprising amino acid residue Asp 100,B capable of ionic 

interaction, binding area G comprising amino acid residue Lys 109lB capable of ionic interaction, 
binding area H comprising amino acid residue Lys 113IB capable of ionic interaction, binding 
area I comprising amino acid residue Arg 94 " 3 capable of ionic interaction, and binding area K 
comprising amino acid residue Lys 104tB capable of hydrogen bonding or ionic interaction. The 
20 method comprises computationally evolving a ligand using an effective algorithm so that the 
evolved ligand comprises at least three effective moieties located relative to each other in the 
ligand so that a first moiety can bind with a first of said five binding areas, a second moiety 
can bind with a second of said five binding areas, and a third moiety can bind with a third of 
the binding areas. The algorithm may be a genetic algorithm. 
25 Tne invention also provides a ligand for binding with the common neurotrophin 

receptor p75 NTR , wherein p75 NTR comprises a binding site including amino acid residues Cys 39 * 
to Cys 9 * inclusive, said residues of said binding site having spatial coordinates given in 
Appendix 3. said binding site having three binding areas including binding loop 2A comprising 
region Cys 390 to Cys 5 * capable of attractive electrostatic interactions, binding loop 2B 
30 comprising region Cys 5 *" to Cys 79p capable of attractive electrostatic interactions, and binding 
loop 3A comprising region Cys 79p to Cys Mp capable of attractive electrostatic interactions. The 
ligand comprises at least two moieties effectively spaced relative to each other so that a first 
moiety can bind to a first loop and a second moiety can bind to a second loop. 

The present invention also provides a method of designing a ligand for binding with 
35 the common neurotrophin receptor pTS^, wherein p75 MTR comprises a binding site including 
amino add residues Cys 3 " to Cys Mp inclusive. The residues of the binding site have spatial 
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coordinates given in Appendix 3. The binding site has three binding areas including binding 
loop 2A comprising region Cys 39 " to Cys 5 * capable of attractive electrostatic interaction 
binding loop 2B comprising region Cys 5 * to Cys 79 » capable of attractive electrostatic 
interaction, and binding loop 3A comprising region Cys 79p to Cys 94 " capable of attractive 
electrostatic interaction. The method comprises computationally evolving a ligand using an 
effective algorithm so that ihe evolved ligand comprises at least two effective moieties 
located relative to each other in the ligand so that a first moiety can bind to a first of the three 
loops and a second moiety can bind to a second loop of the three loops. 

The invention provides a method of identifying a ligand to bind with a leucine rich motif 
(LRM) of TrkA. The method comprises providing a three dimensional conformation for the 
LRM. The LRM comprises amino acid residue sequence 93 to 1 17 inclusive, the amino acid 
residues having spatial Cartesian coordinates given in Appendix 1. The LRM has five binding 
areas, area A comprising amino acid residue Phe 105 " capable of hydrophobic interaction 
area B comprising amino acid residues Phe 111 " Phe 113 *. and Thr 11 - capable of hydrophobic 
15 interaction, area C comprising amino acid residues Asp 109 " and His 112 " capable of ionic 

interaction, area D comprising amino acid residue Lys 10 °« capable of ionic interaction and 
area E comprising Asn 9 ** to lie 98 * capable of multiple parallel fj-strand type hydrogen 
bonding. The method includes providing a data base containing molecules coded for spatial 
occupancy, relative atomic position, bond type and/or charge; and screening the data base to 
select those molecules comprising moieties having a three dimensional conformation and 
effective charge that can bind with at least three of the five binding areas. 

The invention provides a method of identifying a ligand to bind with a leucine rich motif 
(LRM) of TrkB. The method comprises providing a three dimensional conformation for the 
LRM. The LRM comprises amino acid residues 93 to 117. the coordinates of the residues of 
the LRM are given in Appendix 2. the LRM having five binding areas, binding area F 
comprising amino acid residue Asp 100 * capable of ionic interaction, binding area G comprising 
amino acid residue Lys 1 <** capable of ionic interaction, binding area H comprising amino acid 
residue Lys 1,3te capable of ionic interaction, binding area I comprising amino acid residue 
Arg 9 ™ capable of ionic interaction, and binding area K comprising amino acid residue Lys 1M * 
capable of hydrogen bonding or ionic interaction. The method includes providing a data base 
containing molecules coded for spatial occupancy, relative atomic position, bond type and/or 
charge; and screening the data base to select those molecules comprising moieties having a 
three dimensional conformation and effective charge that can bind with at least three of the 
Five binding areas. 

The present invention also provides a method of identifying a ligand to bind with 
common neurotrophin receptor p75 NTR The method comprises provid.ng a three dimensional 
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conformation for the common neurotrophin receptor p75 NTR , wherein p75" TR comprises a 
binding site including amino acid residues Cys 39 * to Cys 9 " inclusive. The residues of the 
binding site have spatial coordinates given in Appendix 3, the binding site having three 
binding areas including binding loop 2A comprising region Cys 39 " to Cys 58 ' capable of 
5 attractive electrostatic interaction, binding loop 2B comprising region Cys»» to Cys 79 * capable 
of attractive electrostatic interaction, and binding loop 3A comprising region Cys 79 > to Cys 9 * 
capable of attractive electrostatic interaction. The method includes providing a data base 
containing molecules coded for spatial occupancy, relative atomic position, bond type and/or 
charge; and screening the data base to select those molecules comprising moieties that can 
1 0 bind with at least two of the three binding loops. 

in one aspect of the invention, a ligand (e.g.. small molecule) designed to interact with 
a receptor such as Trk or P 75NTR may be Afunctional, such that a first portion of the ligand 

b.nds to one receptor monomer and a second portion of the ligand binds to a s«mnH ~ 

monomer. Thus, a receptor dimer is effectively cross-linked by the interaction of ligand with 
1 5 both receptor monomers. 

According to the invention, detailed information is provided regarding those amino 
acid residues of neurotrophic and of neutrophin receptors which are involved in 
neurotrophin-receptor binding. Such information includes the bioactive conformations of the 
respective peptide domains of the neurotrophins and receptors. This in turn permits the 
design of small molecules which interact with said amino acid residues or domains Where 
such interaction block the residues or changes the conformation of the domain, the small 
molecule can inhibit neurotrophin-receptor binding, in certain cases abolishing it completely 
As would be appreciated by a person skilled in the art. small molecule design according to 
this aspect of the invention is applicable to other ligand-receptor interactions in addition to 
25 neurotrophins. according to the methods disclosed herein. 

As a person skilled in the art would appreciate, the invention not only provides 
molecules (ligands) that interact with receptors, but also molecules which, in their mimicry of 
a receptor's bioactive conformation, can bind to the receptor binding site of a ligand such as 
for example, a neurotrophin. Such molecules, whether they interact with ligand or with 
30 receptor, can block native ligand-receptor interaction, .n some embodiments of the invention 
a small molecule which binds to a receptor may even trigger signal transduction similar to that 
of the native ligand, i.e.. act as an agonist. 
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The invention will be described, by example only, reference being had to the 
accompanying drawings, in which: 

the f '''TP I 8 " 0 " ^ amin ° SSqUenCeS ° f ^ COnfo - at —»y -riab.e termini of 
he dimenc NGF analogues under study. The residues are numbered from 1 to 1 18 fbr the 
ft* monomer, and from r to 118' for the second monomer. The fixed parts of the proteins 
s.mu.ated by the X-ray geometry of mouse NGF (McDonald er a/.. 1991) are underlined 

(1 ll^hNGTn 2 S T the x trUCtUre ^ COmP,6X ° f ^ amjn ° ^ ^ * 

<1-118)hNGF homodimer. The carboxy. terminus is shown in grey, the amino terminus in 

white, and the interacting conservative region His" -Asn" of the "south" part of the NGF 

dimer, in black. 

Fi9Ure 3 Sh ° WS ,he Stmcture of ,he c ^P«ex of the amino and carboxv. term.ni „, 

(i-lT8)mNGF homodimer. ' . .. -. 

Figure 4 shows the structure of the complex of the amino and carboxy. termini of 
dimenc mNGFA3-9 molecule. 

Figure 5 shows the structure of the complex of the amino and carboxyl termini of 
dimeric hNGF-H4D molecule. 

Figure 6 shows the structure of the com P .ex of the amino and carboxy. termini of 
dimeric mNGFA1-8 molecule. 

phmp K R9Ure 7 Sh ° WS StmCtUre ° f C ° mP,eX ° f ,he amin ° and termini of 

BDNF homodimer. 

Figure 8 il.ustrates the TrkA leudne-rich motifs. Locations of parallel P-strands are 
•Hustrated by the shaded areas designated by symbo. p. Fragments B1-B4 demonstrate NGF 
binding affinity, while N1-N4 do not (Windisch et at., 1995a, 1995b). 

Figure 9 shows sequences of the amino and carboxyl termini of fully active NGF 
analogues. Regions with rigid structures are underlined. 

Figure 10 shows the predicted bioactive conformation of the NGF binding domain (a) 
and the structures of the NGF/LRM-2A complexes of hNGF (b). mNGF (c), mNGFA3-9 (d) 
and hNGF:NT-4/5 (e). On.y residues essentia, for discussion are designated. A B C D and 
E represent specific NGF/LRM-2A binding areas. The LRM-2A is i.lustrated in grey. NGF in 
white. 

Figure 1 1 shows the TrkB receptor binding site of BDNF (a), predicted features of the 
BDNF/LRM-2B complex (b). and geometry of the "improper NGF/LRM-2B complex (c) OnJy 
residues essentia, for discussion are designated. F. G. H. I and K represent specific 
BDNF/LRM-2B binding areas. The LRM-2B is iiiustrated in grey. BDNF and NGF in white 
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Figure 12 is a detail of the LRM-2A conformation when bound to hNGF. The three 
groups of the B-strand involved in H-bonding with the parallel B-strand motif of the NGF 
analogues are designated by NH or CO. Hydrophobic residues are illustrated as light, polar 
uncharged as dark, charged as black. 
5 Figure 13 presents principal residues of the NGF binding domain essential for TrkA 

receptor recognition. Conserved residues of NGF molecules among all characterized species 
(McDonald ef a/., 1991) are designated by X, whereas those which are substituted in NGF 
molecules derived from different species by similar amino acids and NGF residues 
maintaining the bioactive conformation or interacting with specific binding areas A-E of the 
1 0 LRM-2A are indicated by X. 

Figure 14 presents principal residues of the LRM-2A. The residues which are specific 
for TrkA (Windisch et a/., 1995a, 1995b) and critical for the present model are designated by 
X and X, respectively. 

Figure 15 is a schematic representation of the second leucine-rich motif of TrkA 
(residues 93tA to 116tA) (a) and TrkB (residues 93tB to 116tB) (b) as specific binding sites for 
NGF and BDNF, respectively. General consensus residues are shaded. Arrows designate 
distinct positions essential for molecular recognition processes. 

Figure 16 represents the X-ray crystallography-derived geometry of the a-carbon trace 
of the two loops (2A and 2B) of the second cysteine-rich domain (white bonds) and the first 
20 loop (3A) of the third cysteine-rich domain (grey bonds) of p55 TNFR (Banner et al. , 1 993) (a). 

the preferred geometry of the a-carbon trace of the corresponding fragment of P 75 NTR (b), and 
the predicted bioactive conformation of mouse NGF (McDonald etal., 1991); (c). White and 
grey bonds in panel (c) illustrate different protomers. The p75 NTR binding determinant of NGF 
(loops I and V) is denoted. Disulfide bonds are shown in black. 
25 Fi 9 ure 1 ? shows the sequence of a putative binding site for neurotrophins within 

p75 NTR aligned with the corresponding sequence of homologous p55 TNFR . Conserved pattern 
of disulfide bonds is illustrated. Abbreviation (h) stands for human (Johnson et a/.. 1986), (r) 
for rat (Radeke etal.. 1987), and (c) for chicken (Large et al., 1989) p75 NTR 

Figure 1 8 illustrates the most stable geometries of the complexes of the neurotrophins 
30 with the common neurotrophin receptor p75 NTR : with NGF (a); with BDNF (b); with NT-3 (c); 
with NT-4/5 (d). The a-carbon traces of the neurotrophins are shown in white, while that of 
the p75 NTR binding site is shown in grey. Disulfide bonds are illustrated in black. Only 
residues mentioned in the text are denoted. 

Figure 19 demonstrates geometric compatibility of the van der Waals surfaces of 
35 BDNF (light) and the p75 WTR binding site (dark). 

Figure 20 illustrates conformational changes of the p75 NTR binding site upon BDNF 
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binding. Conformation of the p7S NTK receptor binding site before binding is illustrated in white, 
after binding in grey. The two conformations have been superimposed using the equilibrium 
positions of the reference a-carbon atoms located in loops 2A and 3A. Straight lines denote 
linear approximations of the conformations of the a-carbon traces of loop 2B in both 
structures. 

DETAILED DESCRIPTION OF THE INVENTION 
PART A 

THEORETICAL STUDIES OF THE BIOACTIVE CONFORMATION OF NERVE GROWTH 
FACTOR USING VBMC - A NOVEL VARIABLE BASIS MONTE-CARLO SIMULATED 
ANNEALING ALGORITHM FOR PEPTIDES 

BACKGROUND TO THE VBMC ALGORITHM 

The structural optimization of the peptide domains under study required a rigorous 
search of conformational space. As with other theoretical peptide studies, such calculations 
are plagued by a multiple-minima problem (Gibson and Scheraga. 1988) which traditionally is 
addressed using either molecular dynamics, Monte Carlo approaches or other algorithms. 
The Monte Carlo method, although exceedingly powerful, is limited by the excessive CPU 
time requirements on existing computers. To remedy this problem, we have devised a novel 
Variable Basis Monte Carlo simulated annealing algorithm for peptides (VBMC). 

The potential energy hypersurface of a peptide has an enormous number of local 
minima (Gibson and Scheraga, 1988; Piela et a/., 1989. 1994), but usually only the global 
energy minimum structure is required. Despite efforts to elaborate an efficient method of 
25 global energy minimization (Gibson and Scheraga, 1988; Judson et at., 1994), the 

multiple-minima problem is far from being solved. Although simulated annealing (Kirkpatrick 
etal., 1983; Vanderbilt et a/., 1984) is mathematically correct and guarantees convergence to 
the global minimum in principle, the suggested "cooling" schedule appears to be far too slow 
for practical purposes, and faster "cooling" has to be used (Gidas, 1985; Kushner, 1987). 
30 Within the simulated annealing approach, thermal molecular motion can be simulated either 
by Monte Carlo (Kirkpatrick et a/., 1983; Nayeem et a/., 1991; Wilson et a/., 1988; Snow, 
1992; Shamovsky etal., 1992) or molecular dynamics (Lelj etal., 1991; Villani and Tamburro, 
1994) methods. 

The major problem of the Monte Carlo technique is that motion of a molecular system 
35 along a Markov chain is very slow (Piela ef a/., 1994; Scheraga, 1989; Nayeem ef a/., 1991). 
On the other hand, the Monte Carlo technique has several considerable advantages over 
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molecular dynamics. First, it does not require calculating forces. Second, potential energy 
functions do not have to be continuous. Third, much larger steps in conformational space 
can be utilized (Li and Scheraga, 1988; Kincaid and Scheraga, 1982). Fourth, the additional 
iterative self-consistent SHAKE routine (Ryckaert et a/., 1977), which is used in molecular 
5 dynamics simulations to deal with holonomic constraints (Brooks III and Case, 1993), is not 
required in the Monte Carlo method. This has led to efforts to accelerate configurational 
motion within the Monte Carlo approach (Raj et a/., 1994; Snow, 1992; Kincaid and Scheraga, 
1982; Rao et a/., 1979; Rossky et a/., 1978). The main reason for inefficiency with the 
conventional Monte Carlo method (Metropolis et a/., 1953) is a slow and cumbersome 

1 0 sequential change of configurational variables. 

To overcome these problems, lessons can be learned from molecular dynamics. In 
molecular dynamics (Allen and Tildesley, 1987) configurational variables move 
simultaneously, and the soft modes of the system explicitly direct these motions. Soft modes 
are defined as low-energy valleys on the potential energy hypersurface. Although there have 

1 5 been attempts to direct Monte Carlo configurational motion along the soft modes (Raj et a/., 
1994; Rao et a/., 1979; Rossky e/a/. ( 1978), these improvements require calculating forces 
and, consequently, result in losing the main advantages of the Monte Carlo approach. An 
algorithm which would retain the advantages of the Monte Carlo method and direct 
configurational changes along the soft modes is thus required. One possible solution has 

20 recently been suggested (Szentpaly et a/., 1994). 

The classical Monte Carlo-Metropolis method (Metropolis etai, 1953) consists of 
modelling a Markov chain containing subsequent conformations of the molecule under study: 
x 0 , x 1( x 2 ,..., x„..., where x (x, sfo 1 ...x t m ...x, N }) is the vector of independent structural 
parameters. Each vector x,^ is derived from the preceding one, x, , by means of a random 

25 change of one randomly chosen variable: 

x^r = * m +x m i (1) 

where £ is a random quantity from the interval (-1,1) and A m is a maximal allowed increment of 
30 the given variable. N moves of the Markov chain (where N is the number of independent 
parameters) represent a Monte Carlo step. Whether the new state x, +1 is accepted or 
rejected is decided according to a transition probability function. The following functional form 
of transition probability is used in the present expression (2) (Gunningham and Meijer, 1976): 

35 W(x, - x,,) = 1/[1 * exp(&E, /kT)] (2) 
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where is the difference between conformational energies of the states x m1 and x„ k is the 
Boltzmann constant and T is absolute temperature. 

Those skilled in the art will appreciate that other functional forms of transition 
functions may be used to screen state a new state x K1 . For example, another effective 
5 temperature dependent transition function for screening a new state x*, derived from a 
previous state x, is given by: 

W(x, -x,.t) =1, if E(x 1<1 ) * E(W(x.) 

= exp(-AE, /kT), if E(x^) 2 E(W(x.) 

where AE, is the difference between conformational energies of the states x k1 and x„ k is the 
10 Boltzmann constant and T is absolute temperature. 

The direct application of this conventional Monte Carlo technique to the 
conformational space study of protein structure is not efficient because the conformational 
variables of peptide chains appear to be mutually dependent, i.e., low-energy conformational 
motions involve several conformational variables. When applied to molecular systems such 
1 5 as protein secondary, tertiary or quaternary structure, equation (1). in which structural 

parameters are sequentially changed one after another, results in very small values of £ and. 
consequently, in a very slow conformational motion (Piela era/., 1994; Scheraga, 1989; 
Nayeem et a/., 1991). In order to model a simultaneous change of the structural parameters, 
we have used Eq.(3) (instead of the conventional (1)). 



20 



= x,+X m -^g m (3) 



where g m is a randomly or sequentially chosen basis vector from a basis G^g" 1 }, Z, is a 
random quantity from an interval (-1,1) and X m is a maximal acceptable move in direction g m . 
25 In this study K m was constant and equal to 10°. 

A basis G is formed from linear combinations of independent conformational 
parameters x for their change to be realized along the soft modes of the molecule. These 
modes are revealed by dispersion analysis of the distribution of the sequential configurational 
points of the Markov chain. The soft modes of the molecular system are to be directed along 
30 the axes of the dispersion ellipsoid because they should correspond to the directions of 

maximal deviations made by the independent variables during a certain period of simulation. 
Initially, basis G coincides with a unit matrix, i.e., G = I, and Eq.(3) is equivalent to (1). The 
rotation of basis © is carried out after every 150 Monte Carlo steps based on the particular 
scattering of the previous 150 configurations st. Basis vectors g m are made equal to the 
35 eigenvectors of the covariant matrix D which has the following elements: 
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d„ = (1/M) • I (x n '- (1/M) •ZO • (x„< - (1/M) • Zx „0 (4) 

where M is the number of points in the distribution (M=150), and x n ' is the /-th structural 
parameter at the /Mh Monte Carlo step. Vectors g m form the orthononmalized basis and are 
used in the Markov chain instead of independent conformational parameters. Periodic 
rotations of basis G in Monte Carlo simulations result in a simultaneous change of all the 
conformational variables along the valleys of a potential energy surface, which considerably 
accelerates conformational motions and transitions (Szentpaly et a/., 1994). 

The method disclosed herein is highly advantageous over known processes for 
several reasons. In the present process the preselected range from which the magnitude of 
translation is chosen is constant and does not depend on the basis set. Thus the present 
method is much more efficient than previous methods in part because the magnitude of 
translation along the basis vector is limited by a preselected range. The present method is 
also more efficient because rotation of the basis set of configurational variables is avoided at 
the upper temperatures and is implemented only at the lower temperatures. 
IMPLEMENTATION OF THE VBMC ALGORITHM 

VBMC was written in Fortran-77. The algorithm was then interfaced with the 
CHARMM force field (Brooks et a/., 1983; The force field parameters are extracted from 
CHARMM release 23.1, Molecular Simulations, Inc., Waltham, MA). Torsional distortion, van 
der Waals and electrostatic interactions, and hydrogen bonding were thus explicitly 
considered. The united atom approach was used for all carbons with non-polar hydrogens. 
Point atomic charges of Weiner et al. (Weiner et a!., 1984) were used. A dielectric constant 
equal to the interatomic separation in angstroms was utilized to implicitly simulate solvation 
effects (Weiner et a/., 1984). AH calculations were performed on an 8-node IBM Scalable 
POWERparallel 2 [SP2] high performance computer at Queen's University, Kingston, 
Canada. Each Monte Carlo simulated annealing computation required up to 120 hours of 
CPU time. 

Appendix 4 is attached. Appendix 4 is a computer program written in Fortran-77 and 
is a non-limiting example of a program used to perform the VBMC simulated annealing 
method in accordance with the present invention. 

Each run of the VBMC algorithm provides a unique final structure which would 
represent the global energy minimum structure, regardless of the initial geometry, if the 
"cooling" was performed infinitely slowly. However, since this is impossible, several Monte 
Carlo simulated annealing computations must be performed from different starling points 
using different initial temperatures T 0 , "cooling" schedules and random sequences. An initial 
temperature of 1000° K is sufficient. The Markov chain is kept at a constant temperature T 0 
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for 10 4 Monte Carlo steps before the initiation of "cooling". The temperature T is then 
decreased at every Monte Carlo move (3) by a small increment such that after 2- 10 4 Monte 
Carlo steps it reaches 120° K producing an unrefined global energy minimum structure. 

The global energy minimum structure is then refined to identify the nearest local 
5 minimum or stationary point on the potential energy hypersurface. Refinement may be 

accomplished using any one of several non-linear minimization routines to identify the local 
minimum on the potential energy hypersurface. For example, gradient descent minimization 
and conjugate gradient minimization may be used. Quasi-Newton minimization methods such 
as the Newton-Raphson method may also be used. Other non-linear methods include the 

10 Marquardt method, the Powell method and a variable metric method. Those skilled in the art 
will be aware of the various non-linear routines which may be used for refinement and those 
mentioned herein are more fully described in Nonlinear Programming: Theory, Algorithms, 
and Applications, 1983. New York: John Wiley & Sons. 
APPLYING VBMC TO NGF 

15 The aim of this computational study was to ascertain the biologically active 

conformation of the TrkA receptor binding determinant of NGF. A comprehensive review of 
the literature identified an initial study set of 17 proteins which have been assessed for TrkA 
mediated activity using comparable experimental methods. From these 17 analogues, a 
subset of six (three active and three inactive) which reflects the range of structural diversity 

20 was selected for preliminary study. Accordingly, in the present invention the low-energy 
conformations of the amino and carboxyl termini of six NGF analogues were obtained by 
means of Monte Carlo simulated annealing calculations. Three of these analogues retain full 
TrkA mediated activity: human NGF (hNGF; Shih et a/., 1994; Burton et a/., 1992) t mouse 
NGF (mNGF; Burton et a/., 1992; Drinkwater ef a/., 1993), and mNGFA3-9 (deletion 3-9 of 

25 mNGF; Drinkwater et a/., 1993). The other three analogues are biologically inactive: 

hNGF-H4D (a point mutation of hNGF in which His [H] in position 4 is replaced by Asp [D]; 
Shih et a/., 1994), mNGFA1-8 (Burton e/a/. t 1992; Drinkwater et a/., 1993; Taylor et a/., 1991) 
and BDNF (Suter et a/., 1992). The results and conclusions obtained from these six 
analogues were then validated by being applied to the remaining 11 analogues of the initial 

30 study set. 

The amino acid sequences of the termini of the neurotrophin molecules under study 
are presented in Figure 1. Residues are numbered from 1 to 118 for the first monomer, and 
from 1* to 1 18' for the second monomer. The mutual orientation of the termini of the different 

monomers is determined by ah© disulfide bonds Cys 15 - Cys 80 and Cys 68 - Cys 11Q and 
35 hydrophobic interactions Val 14 :Val 14 ' and Val 109 :VaI 10Sr ( which are absolutely conserved in all 
neurotrophic (McDonald et a/., 1991). The conformational space of the peptide molecules 
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consists of flexible torsional angles determining geometry of the amino terminus of one 
monomer and the carboxyf terminus of the other, while keeping the rest of the dimer fixed in 
the known geometry of mNGF (McDonald eta,., 1991). Cova.ent bond lengths and bond 
angles of the termini are kept fixed. The geometry of each residue is determined only by 

flexible torsional angles of the peptide backbone (cD and V ) and the side chain ( X * , All 
torsional angles of amino acid Pro except for g, are fixed. Torsional angles describing 
structural motifs with the essentia,* p.anar geometry (such as peptide bonds or conjugated 
reg,ons in side chains) are kept fixed. Likewise, the configurations of the sp' -carbons are 
fixed. With these constraints, the total conformational space of the flexible termini of hNGF 
mNGF and BDNF consists of 74. 73 and 67 variables, respectively. 
RESULTS AND DISCUSSION 

By diagonalizing the covariant matrix in the framework of the VBMC algorithm rigid 
modes of a molecular system, which shou.d s.ow down conformational changes are ' 
supposed to be revealed simultaneously with soft modes of the system. To test the efficiency 
of the VBMC algorithm, its performance was compared with the conventional Monte-Carlo 
method at the equilibrium parts of the Markov chains generated at different constant 
temperatures. These comparisons showed that at high temperatures (700°-1000 0 K) the basis 
vanability results in decreasing acceptance rates. On the other hand, the temperature 
decrease reinforces the advantage of the VBMC approach, and at temperatures be»ow 600° 
acceptance rates of the VBMC Markov chains were significantly higher. Thus at T=300° K 
the acceptance rate of the conventional Markov chain was found to be 0.22 which is very 
close to that recommended by Kincaid and Scheraga (1982) as optimal (0 20) The 
acceptance rate of the VBMC algorithm obtained at the same conditions was 0 26 i e 18% 
h,gher. These evaluations are in agreement with what one would expect. Indeed at low 
temperatures only soft modes of molecules are involved in thermal motions, hence they are 
especially populated in corresponding Monte-Carlo generated conformationa. ensembles and 
can be revealed from them. Higher acceptance rates of the .ow temperature VBMC Markov 
chams over conventional Markov chains mean that the average increase of the sizes of the 
Monte-Cario moves a.ong soft modes of the system are higher than their decrease along the 
ng,d modes. Thus, the VBMC disclosed herein for evaluating peptide conformations not only 
changes directions of the Monte-Carlo moves, but a.so accelerates conformationa. changes 
by erecting them a.ong the soft modes of the molecule. Consequently, the present VBMC 
algonthm ,s more efficient than conventional Monte-Cario method only at moderate and low 
temperatures, when thermal motions tend to take place a.ong the valleys of the potenSial 
energy surface. This particular temperature region causes serious problems in known 
s,mu.ated annealing algorithms, and the VBMC method disclosed here,n provides a solution 
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to these problems. 

Seven independent VBMC simulations for each of the six molecular systems under 
study were performed. Simultaneously, local energy refinements were periodically carried out 
from a number of points generated during the "slow cooling" stage. AM local minimizations 
5 resulted in higher energy stationary points on the potential energy surface than those 

produced by VBMC computations. This means that the VBMC technique as an approach to 
global minimization cannot be replaced by relatively inexpensive periodic energy refinements 
from the high-temperature conformational space; at high temperatures only high-energy 
conformers are sufficiently populated. Although VBMC runs did not converge to the same 
10 conformation, the final structures differed insignificantly; i.e., the peptide backbone 

conformation remained unchanged but there were limited differences in the side chain 
torsional angles. Multiple VBMC simulations identified rigid and flexible parts of the 
molecules. Minimum energy structures of all molecules are displayed in Figures 2-7 while 
Table 1 presents optimized conformational parameters of the amino and carboxyl termini of 
15 the considered NGF analogues. 

GEOMETRIC SIMILARITY OF BIOACTIVE NGF CONFORMERS 

Rigid regions of hNGF and mNGF which maintain the same conformation in all 
independent VBMC simulations are formed by residues 9-1 1 and 1 12'-1 18', creating a richly 
hydrogen bonded (H-bonded) 3-dimensional complex. Conformational variability within this 
20 rigid zone is primarily restricted to side chains and does not affect the geometry of the 

backbone. An interesting example of such side chain mobility occurs with Arg 114 ', which is 
H-bonded to the carbonyl oxygen of the ninth amino acid of the amino terminus and which is 
therefore expected to have a fixed conformation. Nevertheless, Arg 114 has two distinct 
conformers in which the H-bonding is realized by the hydrogen atoms attached to the different 
25 NH 2 groups, with the energy difference being about 1 kcal/mol. In addition to local 

conformational variability, hNGF and mNGF have a region of "cooperative" flexibility which is 
made up of residues 1-8. Several distinct conformations exist within this region, with its 
conformational variability being caused by a number of possibilities for the H-bond network 
formed by residues 1-3. 

30 Comparison of the most stable conformers of hNGF (Figure 2) and mNGF (Figure 3) 

reveals striking conformational similarity between their rigid regions in spite of several 
differences in the primary structure of the amino and carboxyl termini (Figure 1). As shown in 
Table 1, both termini have the same conformation, except for the variable region consisting of 
residues 1-3. The most stable H-bond networks in this area differ considerably because of 

35 the mT3hS substitution. 

The essentia! difference between the amino acid sequences of the termini of hNGF 
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and mNGF is the mM9hR substitution, which does not change the geometry of the rigid 
region. Residues Met and Arg have different side chain types, i.e., the former is non-polar 
while the tatter is polar and positively charged under physiological conditions. It is surprising 
that this substitution does not affect geometry since it introduces the positive charge of Arg 9 
5 which is located in an electrostatic field created by at least five neighbouring charged 

functional groups. Conformational stability of the rigid region of hNGF is maintained by the 
exact fit of the Arg 9 side chain into a "hollow" in the rigid region, which is occupied by the Met 9 
side chain in mNGF. Furthermore, the Arg 9 side chain, being in this "hollow", forms two 
additional interchain H-bonds with carbonyl oxygens of residues Val 117 ' and Arg 118 ". This 
1 0 further stabilizes the rigid region of hNGF. 

The physical reason for the geometric similarity of the rigid regions of hNGF and 
mNGF is that the structural features of these two complexes are determined by similar 
H-bonding and electrostatic interactions. The major stabilizing factor is the H-bonding and 
electrostatic interaction of charged side chains of residues Glu 11 and Arg 118 . In addition, 
15 these side chains are involved in other H-bonding: the carboxyl oxygens of Giu 11 are bonded 
to the 6-NH group of His 8 , the hydroxyl group of Ser 113r and the backbone NH group of Lys 115 '; 
the side chain NH and NH 2 groups of Arg 118 * are bonded to the backbone Phe 7 and Lys 115 ' 
carbonyl groups. There are some other common interchain H-bqnds in hNGF and mNGF. 
The e-amino group of the Lys 115 ' side chain forms a H-bond with the 6-carbonyl oxygen of the 
20 Asn 77 ' residue, situated at the "south" part of NGF structure (McDonald et a/. ( 1991). 

In addition to H-bonding, the particular geometry of the rigid region in both molecules 
is also maintained by electrostatic interactions of positively charged residues, specifically 
His 4 , His 8 , Arg 114 ', Arg 118 " and His 75 '. An electrostatic repulsion between His 4 and boundary 
residues His 8 and Arg 118 " of the rigid region forces His 4 to be located as far as possible from 
25 the rigid region, separating the 1-8 loop from the rest of the structure. This tendency is 
further reinforced in hNGF by the presence of a third positively charged boundary residue 
Arg 9 . The His 75 amino acid located in the "south" part of NGF also participates in maintaining 
the structure of the rigid region. An electrostatic repulsion between His 7 * and Arg 114 prevents 
H-bonding of the amino group of Arg 114 with the Asn 77 side chain and the Ser 113 backbone: 
30 indeed, removing the e-proton from His 75 results in two H-bond contacts which change the 
global energy minimum structure of the complex. 

The separation of flexible loop 1-8 from the rigid region is also determined by residue 
Pro 5 . This amino acid imposes restrictions on conformational motion of the loop, making it 
"bent" at this particular point, ©nd thereby defining the location of the adjacent His 4 residue at 
35 the most distant point from the riged region. Thus, residues His 4 and Pro 5 are primarily 
responsible for the separation of the flexible loop from the rigid region. 
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Although mNGFA3-9 does not have the flexible loop, the energy optimized geometric 
features of its termini are very similar to those of the hNGF and mNGF rigid regions (Table 1, 
Figures 2-4). There are only two structural elements which are specific for the rigid region of 
mNGFA3-9. First, although Arg 116 ' has a different conformation, the result is functionally the 
5 same, namely both NH 2 groups of the Arg 118 side chain are H-bonded to the Glu 11 side chain. 
In addition, the e-NH group of Arg 118 ", which is H-bonded to the Phe 7 carbonyl group of the 
flexible loop in both hNGF and mNGF, faces the carbonyl terminus in mNGFA3-9. Second, 
the "hollow", which is occupied by the ninth amino acid side chain in hNGF and mNGF. is now 
filled with the protein backbone of the remainder of the loop, i.e., residues 1-2. Functionally, 
10 the structure of the rigid region of mNGFA3-9 resembles that of hNGF because of the 
stabilizing interchain H-bond contacts of the moieties located in the "hollow". 
GEOMETRIC DISTINCTION OF NGF ANALOGUES INACTIVE FOR TrkA BINDING 

The energy minimum structures of the complexes formed by the amino and carboxyl 
termini of molecules, hNGF-H4D, mNGFA1-8 and BDNF, which are inactive with respect to 
15 TrkA binding, do not possess the common structural features found in active molecules. 

Point mutation H4D in hNGF incorporates negative instead of positive charge into the 1-8 
loop. As a result of this mutation, the loop is no longer separated from the rigid region, 
forming a united complex instead (Figure 5). In this complex the negatively charged carboxyl 
group of Asp 4 is surrounded by four positively charged NH 2 groups of Arg 9 and Arg 114 , and 
20 residue His 8 is not H-bonded to the Glu 11 carboxyl group. Thus, because of the H4D point 
mutation, the common structural features inherent in hNGF and mNGF are lost from hNGF- 
H4D. 

Deletion 1-8 also changes dramatically the most stable structure of the rigid region of 
mNGF (Figure 6). The newly created positively charged amino terminus is located between 

25 negatively charged carboxyl groups of Glu 11 and Arg 118 ", with charged side chains of Arg 114 , 

Lys 115 and Arg 118 * being substantially shifted from their preferred positions in active molecules. 
Thus, the electrostatic field caused by the positive charge of the new amino terminus in 
mNGFA1-8, located in the vicinity of the carboxyl group of Glu 11 , is responsible for the 
destruction of the geometry of the rigid region inherent in the active molecules. 

30 The complex formed by the amino and carboxyl termini of BDNF is also significantly 

different from those formed by the molecules active for TrkA binding (Figure 7, Table 1). 
There are several reasons for the considerable conformational changes in BDNF. First, 
negatively charged residue Asp 5 eliminates the separation of the flexible loop from the rigid 
region, and interacts with the positively charged His 3 , Arg s and amino terminus. Second, 

35 because of its long tail, Arg 8 is unable to form the H-bond contact with the Glu 11 side chain 
and forms bifurcated H-bonds with the carbonyl group of Gly 10 . Third, Arg 116 * is H-bonded to 
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Glu 66 located in the "south" part of the molecule, which imposes a specific restriction on 
conformational possibilities of the carboxyl terminus of BDNF. 

EXPLANATION FOR THE OBSERVED STRUCTURE-ACTIVITY RELATIONSHIPS 

The principal result of the molecular simulations presented is that the NGF molecules 
5 with full TrkA mediated activity considered in this study have regions with similar 

3-dimensional structure, whereas in all inactive analogues, the geometry of this region is 
altered, mainly by novel electrostatic interactions. It is therefore reasonable to attribute the 
TrkA mediated activity to a specific conformation of this particular region. The amino and 
carboxyl terminal regions therefore represent a "functional epitope" which contains relatively 
1 0 few contact residues mediating high affinity binding. The ability of a small number of residues 
to cause high affinity binding has previously been demonstrated in other systems and may 
represent a general property of protein-protein interfaces. Some experimental data consistent 
with this concept are presented below. 

The VBMC computations show that the following amino acids are critical in 
15 determining the structure of the complex in wild type NGF dimers: His 4 , Pro 5 , His 6 , Arg 9 , Glu 11 , 
His 75 ', Asn 77 *, Lys 115 ' and Arg 118 '. This is significant since residues His 4 , Pro 5 , Glu 11 , His 75 , 
Asn 77 and Lys 115 ' are conserved in NGF molecules across all species (McDonald et a/., 1991). 
The overlap between these two sets reflects a tendency to maintain the particular geometry 
of the complex in natural NGF molecules. In addition, as demonstrated, the substitution of 
20 the important residue Arg 9 by the essentially different Met 9 does not result in geometric 

changes of the complex, which is consistent with equivalent bioactivity of hNGF and mNGF 
(Burton et a/., 1992). Thus, because of multiple physical factors maintaining the geometry of 
the complex, some species variability observed in NGF termini (McDonald et a/., 1991) is 
acceptable and does not cause geometric changes in the rigid region. 
25 It is shown that each NGF terminus has several important residues, which determine 

the particular structure of the complex. This is consistent with high sensitivity of biological 
activity of NGF to the primary structure of both the amino and carboxyl termini (Klein et a/., 
1991; Shihef a/. t 1994; Burton e/a/., 1992, 1995; Kahle etal., 1992; Drinkwater et a/. , 1993; 
Treanor ef a/., 1995; Taylor et at., 1991). The lack of the TrkA mediated activity of the 
30 mNGFA3-11, mNGFA3-12, mNGFA3-13, mNGFA3-14, rnNGFA9-14 and mNGFA112-118 
deletion mutants (Drinkwater et a/., 1993) is thus explained by the absence of the key 
residues Glu 11 and Arg 118 *. 

The separation of the flexible loop from the rest of the complex is crucial in permitting 
the rigid region to exist \mih definite geometric features. Residues His 4 and Pro 5 which cause 
35 this separation are absolutely conserved elements of the wild type NGF molecules derived 

from different species (McDonald et a/., 1991), and their compulsory location in the middle of 
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the flexible loop is well suited for facilitating the maximal separation. Point mutation PSA or 
H4D weakens or destroys the tendency of the loop to separate from the rigid region; 
accordingly, these mutations result in dramatic loss of TrkA related activity (Shih et a/., 1994) 
The flexible loop is not an essential structural element for TrkA activation since the 
5 mNGFA3-9 deletion mutant, which does not contain the loop (Figure 4), is as active as wild 
type mNGF (Drinkwater et a/., 1993). However, complete elimination of all loop residues 
results in almost inactive mutants mNGFA1-8 (Burton et a/., 1992; Drinkwater et aL, 1993; 
Taylor et a/., 1991) and hNGFA1-9 (Shih et a/., 1994; Burton etal. y 1992 f 1995; Kahle e/aA, 
1992; Luo and Neet, 1992). Our molecular simulation studies enable the explanation for this 

1 0 apparent contradiction. It is demonstrated that the backbone of residues Ser 6 -Ser 3 in the 
mNGFA3-9 mutant is naturally incorporated into the structure of the rigid region because it 
matches the geometry of the "hollow" occupied by either Met 9 or Arg 9 side chain in wild type 
NGF dimers. On the contrary, total truncation of the flexible loop in the mNGFA1-8 deletion 
mutant results in placing the positively charged amino end in the vicinity of the key H-bonded 

1 5 Glu 11 and Arg 11ff charged side chains (Figure 3), which abolishes the desired geometry of the 
rigid region (Figure 6). The same applies to the hNGFA1-9 deletion mutant which has neither 
the 1-8 loop nor the stabilizing Arg 9 residue (Figure 2). This being the case, biological activity 
of amino terminus deletions of NGF should be determined by the length of the remainder of 
the terminus rather than its particular sequence, indeed, available experimental data confirm 

20 this point. Deletion mutant mNGFA2-8 which has exactly the same number of residues in the 
amino terminus as the highly active mNGFA3-9 mutant is also very active (Drinkwater et a/., 

1993) . Some decrease in biological activity of mNGFA2-8 is likely to be caused by 
unfavorable directing of the Met 9 hydrophobic side chain away from the complex whereas in 
the mNGFA3-9 deletion mutant this side chain is replaced by the hydrophilic hydroxyl group 

25 (Figure 4). Deletion mutants mNGFA3-8 (Drinkwater et a/., 1993) and hNGFA1-5 (Shih et aL, 

1994) , which contain a greater number of amino terminus residues, are remarkably less 
active than wild type NGF because the remainder of the loop is too long to occupy the 
"hollow 1 ' but too short to separate from the rigid region. The present explanations for the 
observed structure-activity relationships enable the prediction that, for example, a S2G 

30 mutation in the mNGFA3-9 deletion mutant would retain high biological activity. 

Although the bioactive conformation of the NGF binding determinant has been 
disclosed above, the question of which NGF residues are involved in NGF/TrkA recognition 
was not yet addressed. One would reason, most probably, those residues which are primarily 
(responsible for maintaining the particular structure of the NGF termini do not directly interact 

35 with the receptor. Thus, side chains of His\ Pro 5 , His 8 , Met 9 , Arg 9 , Glu 11 and Arg 118 are not 

expected to play a crucial role in molecular recognition. On the other hand, charged residues 
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Arg 114 " and Lys 115 ' which are conserved in NGF molecules across all species (McDonald etaL t 
1991) but not involved in principal intramolecular electrostatic interactions are candidates for 
the centres of intramolecular electrostatic recognition. Although both Arg 114 and Lys 115 * 
participate in stabilizing intramolecular H-bonding within the bioactive conformations of the 
NGF termini, their electrostatic properties as charged residues are not utilized for this 
purpose. 

The other structural element of the termini with potential for stereochemical fit to the 
TrkA receptor is a short exposed (J-sheet motif located between the a-carbons of residues 7 
and 9 (Figures 2,3). On the one hand, H-bonding between fJ-pleated sheets is rather 
common for protein-protein interactions (Kobe and Deisenhofer, 1993, 1994 ( 1995; Yoder et 
a/., 1993; Baumann et a/., 1993). On the other hand, a short exposed (J-sheet motif is 
generally inherent in the structure of a leucine-rich repeat (Kobe and Deisenhofer, 1993, 
1994, 1995) which has been recently identified as the TrkA recognition site for NGF 
(Windisch et a/., 1995a, 1995b). Other regions of NGF might also be involved in the 
NGF/TrkA recognition (Shih et a/., 1994; Ibanez et a/., 1993). This will be discussed in depth 
below. 

CONCLUSIONS 

The Variable Basis Monte Carlo simulated annealing technique disclosed herein 
enables analysis of the low-energy conformational space of peptide molecules. Although 
multiple VBMC calculations do not consistently converge to the global energy minimum 
structure, they do permit the biologically active conformation to be evaluated. Definite 
physical factors maintain and distinguish the bioactive conformation of a molecule, and these 
factors can be understood by analyzing the low-energy conformational space. The precisely 
defined global energy minimum structure is unnecessary, especially considering that any 
empirical force field functions are not ideal. In biologically relevant molecular modelling, 
exploring the bioactive conformation is more important than determining the global energy 
minimum. VBMC is a useful tool for facilitating this exploration. 

The present VBMC simulations predict that the amino (1-1 1) and carboxyl (112-118') 
termini of dimeric (1-118)NGF form a complex comprised of a rigid region and a 
conformational^ variable loop. The rigid region is formed by residues 9-1 1 and 1 12'-1 18* 
while the variable loop is formed by residues 1-8. The major stabilizing factor of the rigid 
region is the ionic contact of the Glu 11 and Arg 11 * side chains. The separation of the loop 
from the rigid region arises from the electrostatic repulsion between His 4 and boundary 
residues of the rigid region His 8 , Arg 9 (in hNGF) and Arg 116 *. The geometry of the complex 
explains observed structure-activity relationships. Accordingly, it is the conformation of the 
rigid region which is recognized by the TrkA receptor, defining the biologically active 
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conformation of NGF for TrkA activation. 

The split of the complex of the NGF termini into two distinct moieties could have 
important biological implications. Although only the structure of the rigid region is recognized 
by the TrkA receptor, the flexible loop is able to control conformational stability of the 
5 biologically active conformation of the rigid region and, consequently, NGF-induced TrkA 
phosphorylation. 

PART B 

THEORETICAL MODELLING OF Trk-RECEPTOR RECOGNITION SITES FOR NGF AND 
10 BDNF 
INTRODUCTION 

As mentioned above, the location of the neurotrophin binding sites within the Trk 
receptors is the subject of debate (MacDonald and Meakin, 1996). Based upon published 
data, there is reason to believe that Trk proteins have two putative neurotrophin binding sites: 

1 5 the "second immunoglobulin-like domain" (lgC2 ( residues Trp^-Asn 365 ) and the "second 
leucine-rich motif" (LRM-2A and LRM-2B, residues Thr^-Leu 120 of TrkA and TrkB, 
respectively) (Schneider and Schweiger, 1991; Kullander and Ebendal, 1994; Perez e/a/., 
1995; Urfer etaL, 1995; Windisch etal. t 1995a, 1995b, 1995c; MacDonald and Meakin, 1996; 
Ryden and Ibanez, 1996). LRM containing proteins are a diverse group with different 

20 functions and cellular locations, and are known to mediate strong and selective 

protein-protein interactions (Kobe and Deisenhofer, 1993, 1994, 1995). The amino acid 
sequence of an LRM is characterized by conserved hydrophobic consensus residues 
(Schneider et a/., 1988; Schneider and Schweiger, 1991; Kobe and Deisenhofer, 1993, 1994, 
1995). Typically, LRMs exist as repetitive cassettes, with an individual LRM representing a 

25 right-handed £-cr hairpin unit (Kobe and Deisenhofer, 1993, 1994, 1995). Recent 

crystallographic studies on porcine ribonuclease inhibitor indicate that the side chains of the 
consensus residues form the hydrophobic core of the LRM modules, thereby determining 
their geometric features (Kobe and Deisenhofer, 1993). 

According to the invention, the inventors report explicit atom-level models of the 

30 receptor sites for NGF and BDNF based on the structures of the second LRMs of TrkA (LRM- 
2A) and of TrkB (LRM-2B), respectively. To deduce these models, we have investigated the 
interaction of the bioactive conformation of the NGF amino and carboxyl termini region with 
the LRM-2A and of the "waist" region of BDNF with the LRM-2B. A series of four active NGF 
analogues, namely human NGF (hNGF) (Burton et a/., 1992; Shih et a/., 1994), mous© NGF 

35 (mNGF) (Burton et ©A, 1992; Drinkwater et a/., 1993), the 3-9 deletion of mNGF (mNGFA3-9) 
(Drinkwater et a/., 1993) and the hNGF:NT-4/5 heterodimer (Treanor et a/., 1995), have been 
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docked to the LRM-2A of rat TrkA. For all four NGF molecules studied, a stereochemical fit of 
the LRM-2A to the amino and carboxyl region was achieved. These results indicate that the 
bioactive conformation of the NGF termini is a functional epitope which mediates high affinity 
binding via an interaction with the LRM-2A. This approach has also been applied to 
BDNF/LRM-2B docking, and their stereochemical fit, based mainly on electrostatic 
complementarity, has been identified and is described below. 



METHODS 



15 



10 



A sequence of three logical steps has been used to deduce the models of the 
interactions of NGF and BDNF with their respective Trk receptor binding sites: Step 1: The 
bioactive conformations of the amino and carboxyl termini of NGF analogues were 
determined. Step 2: The residues of the Trk proteins which form potential binding sites for 
NGF and BDNF were identified. Step 3: The interaction of the bioactive conformation of the 
NGF amino and carboxyl termini with the TrkA binding site and the BDNF P-strand stem 
region with the TrkB binding site were modeled computationally. Finally, the predicted 
features of the NGF/TrkA and BDNF/TrkB interaction models were validated against available 



experimental data. Residues of the first monomer of a neurotrophin are designated 1 to 1 18, 
those of the second monomer are designated 1' to 118'. Residues of the TrkA LRM-2A and 
the TrkB LRM-2B binding sites are designated by abbreviations tA and tB, respectively. 
Residues of the BDNF monomer are designated 1 to 1 19. 

20 Step 1: REVEALING THE BIOACTIVE CONFORMATION OF THE NGF TERMINI 

The most energetically favored conformations of the amino and carboxyl termini of 
NGF analogues have been obtained by VBMC computations and are described above. Since 
the 3-dimensional structures of the termini of fully active NGF molecules were found to be 
different from those of TrkA-inactive analogues, the former conformations were predicted to 

25 be "bioactive" for TrkA-mediated activity of NGF. That is, those geometric features that are 
exclusively inherent to active NGF molecules are expected to be compatible with the structure 
of the TrkA binding site for NGF. Since the geometry of the central (J-strand stem of BDNF is 
rigid and has been established by X-ray crystallography (Radziejewski and Robinson, 1993), 
this logical step is bypassed for BDNF. Instead, it is assumed that the X-ray crystallography 

30 deduced conformation of the amino acid backbone of the TrkB binding determinant of BDNF 
represents its "bioactive" conformation. 

Step 2: SELECTION OF THE LRM-2 BOUNDARIES WITHIN THE TRK RECEPTORS 

Since there is controversy in the literature regarding the actual boundaries of LRMs 

(Schneider et a/., 1988; Schneider and Sefow^iger, 1991; Kobe and Deisenhofer, 1993, 1994, 
35 1995; Windisch et a/., 1995a, 1995b), the following experimental background (Figure 8) was 



used to define boundaries for this computational study. According to Schneider and 
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colleagues (1988, 1991), the LRM domains of TrkA begin with Leu MtA , Leu 93 " and Leu 117 * 
whereas those boundaries are slightly shifted in the crystallographic studies of Kobe and 
Deisenhofer (1993). Furthermore, the boundaries of the "minimal" LRM module of TrkA 
(designated B4 in Figure 8), which demonstrates the same NGF binding affinity as wild type 
5 TrkA, are substantially shifted towards the carboxyl terminus (Windisch et a/., 1995a, 1995b) 
with respect to those suggested by Schneider and colleagues (1988, 1991). The same 
boundary shift has been suggested for the LRM-2B module (Windisch et a/., 1995a, 1995b, 
1995c). However, a comparison of active (B1 and B4) and inactive (N4) LRM domains 
indicates that the presence of the "minimal" amino acid sequence (B4) does not ensure NGF 
1 0 binding. Understanding those factors which maintain the essential features of the LRM 
geometry in segments B1 and B4, and those factors which destroy them in N4, is of 
fundamental relevance to the correct assignment of the LRM boundaries. 

p-Strand motifs exist in proteins only if hydrogen bonded (H-bonded) to other 
P-strands; otherwise their structure is unstable (Kobe and Deisenhofer, 1994). Tandem 
15 repeats of LRMs represent a prominent example of H-bonded parallel P-strand motifs. The 
general location of the p-strand motifs in the LRM, according to X-ray data, is illustrated in 
Figure 8; however, in a single LRM their particular geometry is destroyed (Kobe and 
Deisenhofer, 1993, 1994, 1995). Consequently, the only possibe way to exclude dramatic 
conformational changes in a protein domain upon disruption of the inter-strand H-bonding 
20 between p-strand motifs is to avoid those motifs. Accordingly, both boundaries of segment 
B4 are located in the middle of the p-strand regions of the adjacent LRMs such that this 
segment appears to be free of any P-strand. Nevertheless, the p-strand motif is a 
characteristic structural unit of LRMs, and thus it is likely that this unit is involved in the 
NGF/TrkA recognition process. Additionally, a short exposed p-strand motif has been 
25 detected in the bioactive conformation of the NGF termini, as described above. The 

incorporation of the P-strand regions at both sides of the B4 segment (as in N4) results in 
cooperative H-bonding between these p-strands and a complete loss of NGF binding, which 
indicates that accessibility of the p-strand motif for intermolecular contacts is essential. On 
the other hand, the separation of the entire LRM segment from the adjacent LRMs within the 
30 B1 molecule as a result of conformational kinking is likely to be energetically more favorable, 
since the entire LRM module has the internal stabilizing factors. Moreover, kinking between 
adjacent LRMs does not disrupt the structure of their p-strand motifs (since they remain 
H-bonded with their opposite sides), but makes them accessible for molecular recognition 
processes. 

35 Accordingly, in the modelling of the present invention we used the conventional 

boundaries of the LRMs of TrkA and TrkB (Schneider et a/., 1988; Schneider and Schweiger, 
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1991), which keep the P-strand motif in the LRM, and we explicitly arranged H-bonding of this 
unit with the parallel P-strand motif of the NGF amino terminus to preserve it from 
conformational reorganizations. Thus, the segments between residues Leu 93lA and Leu 117lA in 
TrkA, and Leu 93 * 8 and Leu 117lB in TrkB, were respectively designated the LRM-2A and the 
5 LRM-2B. 

Step 3: COMPUTATIONAL SIMULATION OF THE NEUROTROPHIN/RECEPTOR 
INTERACTIONS 

Docking interactions NGF/LRM-2A and BDNF/LRM-2B were simulated using a 
combined VBM C/C HARM M approach. As described above, the VBMC algorithm permits 
10 conformational space search; CHARMM (Brooks et a/., 1983) is a force field which defines 
conformational energy. The same approach was also used to study improper docking 
interactions of the central p-strand stem region of NGF with the LRM-2B, as a negative 
control. 

The VBMC algorithm performs conformational and orientational space searches of a 

15 complicated molecular system with the aim of finding the structure which corresponds to the 
global minimum on the potential energy Et ot surface. 

E tot = E, + E„ + E e + E hb + E^ + E hy + E c (4) 
The following potential energy terms have been considered in the framework of the 
united atom CHARMM force field (Brooks et at., 1983; The force field parameters are 

20 extracted from CHARMM version 23.1, Molecular Simulations, Inc., Waltham, MA): torsional 
distortion E„ van der Waals interactions E w , electrostatic interactions E e , and hydrogen 
bonding E hb . Since the united atom approach is not appropriate to treat rigorously the 
quadrupole-quadrupole interactions of benzene rings (which are important for determining 
their mutual orientations in proteins (Hunter and Sanders, 1990; Hunter et a/., 1991), single 

25 quadrupoles have been placed at the centers of the benzene rings of phenylalanine, 

tryptophan and tyrosine, and the energy of the quadrupole-quadrupole interactions, E qq , has 
been calculated as suggested by Schauer and Bernstein (1985). (The potential energy 
function of two interacting benzene molecules, comprising E^ and E w terms, has stationary 
points corresponding to both known electrostatically favorable "edge-to-face" and "offset 

30 stacked" orientations, with the dimerization energy of 1.94 kcal/mol agreeing well with ab initio 
quantum mechanical calculations (Hobza etal. % 1994).) In addition, the conformational^ 
dependent free energy of hydration, E^, has been calculated according to the hydration shell 
model including both nonspecific and specific hydration terms (Hopfinger, 1973; Hodes et aA, 
1979). Point atomic charges of Weimer and coISeagues (1984) have been used. A dielectric 

35 constant equal to the interatomic separation on angstroms has been used to implicitly simulate 
solvation effects (Weiner et a/., 1984). 
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The E c term is used only for the NGF/LRM-2A complex and represents the distance 
constraints which restrain the H-bonding of residues Glu 11 and Arg 118 within the bioactive 
conformation of the NGF termini, as described above. In addition, two intermolecular H- 
bonds, arising from the alignment of complementary parallel p-strand motifs of NGF and 
5 LRM-2A, are also constrained and contribute to E... In the Monte Carlo conformational space 
search, E e is defined as follows: 

E c =10* £ (d^-1.8) 2 . {5 ) 
where d AB is the distance in angstroms between atoms A and B which are to be H-bonded. 
The E e term is neglected when performing final energy minimization following the VBMC 
1 0 simulated annealing. In addition, this term has not been utilized when studying the 

mNGFA3-9/LRM-2A complex. Since, as described above, the mNGFA3-9 deletion mutant 
does not have a P-strand region, the geometry of its LRM-2A counterpart, namely 
Leu^-lle 9 **, has been kept fixed in the crystallographic conformation (Kobe and 
Deisenhofer, 1993) during Monte Carlo simulations. The E c term, being "soft", does not 
1 5 preclude conformational and orientational changes to occur during Monte Carlo simulations. 
This term increases rigidity of the bioactive conformation of the ligand, which allows us to 
utilize the knowledge obtained at Step 1, instead of explicit considerations of inactive NGF 
analogues, in Step 3. In addition, this term keeps conformations of the p-strands of both 
ligand and binding site from destruction, and thereby allows us to avoid an explicit 
20 consideration of the adjacent first LRM of TrkA. Furthermore, this term maintains the known 
orientation of the parallel 0-strand motifs in the NGF/LRM-2A complex. 

The amino acid sequences of the binding determinants of the NGF molecules studied 
are presented in Figure 9. The conformational space of the NGF analogues consists of 
torsional angles determining geometry of the amino terminus of one monomer (residues 1-11) 
25 and the carboxyl terminus of the other (residues 1 12'-1 18'). while keeping the rest of the 
dimer in the experimental geometry of mNGF (McDonald et a/., 1991). The conformational 
spaces of the LRM domains of TrkA and TrkB consist of torsional angles representing 
fragments Arg^-Leu 11 ™ and Arg^-Leu 117 " 3 , respectively. Covalent bond lengths and bond 
angles have been kept fixed. The geometry of each i-th residue is determined only by flexible 
30 torsional angles of the backbone (<}>, and ip,) and the side chain (yj . x*2,...). All torsional 

angles for prolines, except \\>, have been fixed. Torsional angles describing structural motifs 
with planar geometries (such as peptide bonds or conjugated regions in side chains) were 
fixed. Likewise, the configurations of the sp 3 -carbons were fixed. The conformational space 
of BDNF consists only of torsional angles determining side-chain geometries of those 
35 residues which are located in the vicinity of its central p-strand stem and define a molecular 
surface (Table 2). In addition to the conformational variables of each complex, the six 
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orientational variables (three Euler angles and three translations along the coordinate axes) 
were used to describe the Ugand-receptor orientations. 

In accordance generally with the simulated annealing approach (Kirkpatrick et a/. ( 
1983; Vanderbilt and Louie, 1984; Brunger, 1998; Nilges et a/., 1988; Wilson e/a/., 1988; 
5 Wilson and Doniach, 1989), the molecular systems were kept at a constant temperature of 
800°K for 2x1 0 6 Monte Carlo moves before the initiation of "cooling". The temperature was 
then decreased at every Monte Carlo move by a small increment such that after 2*10 6 Monte 
Carlo moves it reached 120°K. At this time, a local energy refinement was performed to 
identify the nearest stationary point on the potential energy surface. According to the VBMC 

10 algorithm described above, the basis of independent structural parameters was recalculated 
after every 7*10 4 Monte Carlo moves to direct them along the valleys of the potential energy 
surface, thereby accelerating conformational and orientational changes. 

All calculations were performed on an lndigo-2 Silicon Graphics Workstation and an 8- 
node IBM Scalable POWER parallel 2 [SP2] high performance computer. 

1 5 RESULTS AND DISCUSSION 

CONFORMATIONAL FLEXIBILITY WITHIN RECEPTOR ENVIRONMENTS 

Five independent VBMC simulations from different initial configurations were 
performed for each of the six molecular complexes under study. The elimination of the 
distance constraints E c in the complexes with the LRM-2A did not result in any noticeable 

20 conformational changes within the complexes. Although independent VBMC simulations did 
not always converge to the same geometry for the same molecule, the final structures 
differed insignificantly; i.e., the conformations of the peptide backbones remained unchanged 
with the exception of residues 1-3 of the NGF amino terminus and the terminal residue 
Leu 117tA of the LRM-2A. The differences between the VBMC generated structures were 

25 located in side-chain conformations of a limited number of residues, specifically Ser\ Val 117 ' 
(Thr 117 in mNGF), Thr 57 * and Ser 101tA . These residues are not involved in intermolecular or 
intramolecular interactions, and do not affect the overall structures of the complexes. No 
considerable differences were identified in the independent VBMC-generated conformations 
of the BDNF/LRM-2B and the NGF/LRM-2B complexes. 

30 A grid search of the internal space of the complexes using the water molecule radius 

1 .4 A as a probe (Ponnuswamy and Manavalan, 1976) (as incorporated into the QUANTA 
program, Version 4.1, Molecular Simulations, Inc., Waltham, MA) showed that there is no 
internal cavity available for a solvent molecule in the hNGF/LRM-2A complex. This explains 
the rigidity of the complex snd demonstrates the "perfect" stereochemical fit of the hNGF 
. 35 aminocarboxyl termini complex to the LRM-2A. Other complexes have some internal 
cavities, especially with the LRM-2B binding site. 
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SPECIFIC BINDING AREAS WITHIN RECEPTOR ENVIRONMENTS 
The TrkA Receptor Binding Site 

Comparison of the most stable structures of the different complexes with the LRM-2A 
binding site (Figure 10) shows their striking conformational similarity. In all molecules, 
5 ligand-receptor binding is caused by similar hydrophobic, ionic, H-bonding and van der Waals 
interactions. There are five distinct areas (A-E) within the NGF/LRM-2A fit: areas A and B 
have hydrophobic interactions, areas C and D have ionic H-bonding, and area E has 
cooperative H-bonding between parallel p-strand motifs. 

Area A represents the hydrophobic region at which residue Phe 105lA interacts with 
10 Trp 76 *, which in turn forms hydrophobic contacts with Phe 12 and Leu 112 . The aromatic rings of 
Phe 12 , Trp 76 " and Phe 105tA are approximately perpendicular to each other, which is energetically 
favorable (Hunter and Sanders, 1990; Hunter et a/., 1991; Hobza etal., 1994). This particular 
geometry would create a hole within the complex, if it was not filled by hydrophobic residue 
Leu 117 . 

15 in area B, Phe 7 of NGF is placed within the hydrophobic core of the LRM-2A created 

by the consensus residues (Schneider et a/., 1988; Schneider and Schweiger, 1991; Kobe 
and Deisenhofer, 1993, 1994, 1995). Similar to the favorable orientation of the aromatic 
residues in area A, the aromatic rings of Phe 111tA and Phe 113tA are perpendicular to that of 
Phe 7 . On the other hand, residues Phe 111tA and Phe 113lA are not perpendicular to each other; 

20 their orientation instead resembles the favorable "offset stacked" geometry described by 

Hunter and others (Hunter and Sanders, 1990; Hunter et a/., 1991; Hobza et a/., 1994). As in 
area A, a hole created by the three neighboring favorably oriented aromatic rings is filled by a 
fourth residue, Thr 114lA . Interactions between aromatic rings in proteins is very common and 
represent an important conformational driving force (Warme and Morgan, 1978; Hunter et a/., 

25 1991). Consequently, the fact that all three phenylalanine residues of the LRM-2A find their 
counterparts within the NGF binding domain supports the calculated structure of the 
NGF/LRM-2A complex. 

In ionic binding area C, the negatively charged residue Asp 109tA exerts a central role. It 
is surrounded by the three positively charged amino acids His 112tA , Arg 114 ' and Lys 115 in hNGF, 

30 mNGF and mNGFA3-9, and by His 112tA , Arg 115 and Arg 9 in hNGF:NT-4/5. Residue His 112lA 
plays a secondary role: it creates the electrostatic dipole Asp 109tA -»-His 112tA in the LRM-2A, 
which imposes constraints on the spatial positions of the recognized positively charged 
residues of NGF, thereby reinforcing geometric selectivity in this area. Although the 
hNGF:NT=4/5 heterodimer has two substitutions in the C-arsa segment of the NT-4/5 catr&oxyl 

35 terminus, namely R1 14S and K1 15R, the geometry of the C area remains qualitatively the 
same, since residues Arg 9 and Arg 115 ' of hNGF:NT-4/5 functionally replace the absent Lys 115 
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and Arg 114 amino acids, respectively. Although Arg 9 also exists in the hNGF homodimer, it is 
not involved in the ionic binding in this molecule, since it is separated from area C due to 
electrostatic repulsions. Similarly, the charged amino end of mNGFA3-9, which functionally 
replaces Arg 9 , is also separated from area C. There is no place for the fourth positively 
5 charged residue in area C. Thus, in all molecules studied, the two positively charged amino 
acids of the ligand saturate the coordination possibilities of the receptor residue Asp 109tA . 

Area C is partially isolated from its surrounding environment by flanking hydrophobic 
residues VaI 106IA and Pro 115tA . Similarly, the principal charged residues of NGF, namely His 8 , 
Glu 11 and Arg 116 \ are located between hydrophobic residues lie 6 (Val 6 in mNGF), Val 111 (Leu 113 

10 in hNGF:NT-4/5) and Val 99 * within the complexes. Thus, residues Val 99 *, Val 106lA and Pro 115lA 
of the LRM-2A participate in "compartmentalization" of the principal ionic interactions of the 
complexes, thereby shielding them from the conformation destructive influence of solvation. . 

The organization of ionic binding area D is similar to that of area C. Positively 
charged residues Arg 59 and Arg 69 are located in the vicinity of negatively charged Asp 16 within 

1 5 the NGF structure (McDonald et a/., 1991 ; Bradshaw et ai , 1994), such that the coordination 
of the latter is not full. The third positively charged residue, Lys 1001 *, comes from the LRM-2A 
to complete the Asp 16 coordination. In addition to the ionic binding, residue Lys 100lA forms an 
H-bond with the Ser 13 side chain. 

Area E consists of the H-bonded parallel p-strand motifs of the LRM-2A and NGF 

20 (Figure 10b, 10c and 10e), each of the p-strands having three peptide bonds. The LRM-2A 
p-strand motif is located between the ct-carbon atoms of Asn 96 * and lie 98 *, whereas the NGF 
P-strand is located between the a-carbons of the sixth and ninth residues. The locations of 
the P-strand regions in the uncomplexed LRM (Kobe and Deisenhofer, 1993) and NGF (see 
above) resemble those in the complexes considered. In the complexed LRM-2A, the p-strand 

25 motif is shortened by one peptide bond at its amino end to accommodate the His 4 side chain 
of NGF. On the other hand, the two peptide bond p-strand of the uncomplexed NGF is 
lengthened by one peptide bond at its amino end in the complexes. 
The TrkB Receptor Binding Site 

Figure 11a illustrates the central p-strand stem region of BDNF, and Figure 11b 

30 presents the VBMC-generated structure of the BDNF/LRM-2B complex. In this complex, 

iigand-receptor binding is based mainly on intermolecular electrostatic interactions of either 
ionic or hydrogen bonding origin. There are five distinct binding areas (F-K) within the TrkB 
receptor environment: four salt bridges (areas F, G, H and I) and one H-bonding area K. In 
contrast with the NGF/LRW1-2A binding pattern, there is no participation of the LRM-2B p- 

35 strand motif in cooperative H-bonding with the BDNF binding ©pitope. 

In area F, the primary positively charged residue (Ibanez et ai, 1993) of the BDNF 
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binding domain, Arg 81 t is bound to Asp 10aB t the only negatively charged residue of the LRM- 
2B. It should be noted that Arg 81 is exclusively inherent in BDNF molecules while being 
substituted by electrostatically neutral Thr in NGF or by Lys in NT-3 or NT-4/5 molecules 
(Bradshaw et a/., 1994), which indicates the importance of this area in neurotrophin 
5 recognition processes (see also Urfer et eA, 1994). On the contrary, residue Gin 84 , identified 
by Ibanez and co-workers (1993) as important, is not directly involved in specific 
intermolecular interactions within the LRM-2B domain. 

In binding area G, the negatively charged residue Glu 55 , which is absolutely conserved 
in all neurotrophins (Bradshaw et a/., 1994), exerts a central role. It is surrounded by the 

1 0 three positively charged amino acids, Lys 25 , Lys 57 and Lys 109tB . Two of them, belonging to the 
neurotrophin, are also highly conserved (Bradshaw et a/. , 1994). The third one, Lys 109tB , is 
specific for the TrkB receptor site (Windisch et a/. , 1995a, 1995b, 1995c). All BDNF residues 
of the G binding area, namely Lys 25 , Lys 57 and Glu 55 , have been shown to be essential in NT-3 
structure for TrkC receptor activation (Urfer et a/., 1994), which is in agreement with 

1 5 observations that the TrkB and TrkC binding domains are located in the same region of these 
neurotrophins. This particular arrangement of the three positively charged residues around 
one with a negative charge seems to be very common in neurotrophin-receptor recognition 
processes, since the NGF/LRM-2A interaction also includes two binding areas of this type (C 
and D). Existence of the absolutely conserved negatively charged residue Asp 106 in direct 

20 proximity to area G increases binding affinity of Lys 109tB , thereby highlighting some importance 
of this area in the BDNF/TrkB recognition processes. Although detailed mutational 
experiments did not demonstrate importance of the equivalent residue Asp 105 in NT-3 for TrkC 
related activities (Urfer et a/., 1994), its direct involvement in the Zn 2 * mediated regulation of 
neurotrophin bioactivities has been recently proven by multidisciplinary studies (Ross et a/., 

25 1997), and, therefore, its location within the binding determinants of BDNF and NT-3 is 
consistent with these observations. 

In area H, conserved residue Asp 24 of BDNF forms a salt bridge with specific TrkB 
residue Lys n3tB . Although Asp 24 has not been demonstrated to be involved in direct 
neurotrophin-receptor interactions, either for TrkB or TrkC related activities, it is highly 

30 conserved in neurotrophins (Bradshaw ef a/. , 1994) and located in direct proximity to Thr 22 in 
NT-3, which has been shown to be included in the TrkC receptor binding domain (Urfer et a/., 
1994). These observations are consistent with the present model of BDNF/TrkB binding, 
since substitution of the equivalent residue of BDNF, Thr 21 , by bulky Gin (Urfer et a/., 1994) 
would statically hinder proper location off the LRM-2B domain (Figure 11b), whereas mutation 

35 D24A, although lacking one specific ligand-receptor interaction, would not significantly alter 
the BDNF/LRM-2B orientation resulting from multiple-point electrostatic complementarity. In 
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addition, the absence of negatively charged residue Asp 24 is shown to induce salt bridge 
interaction of Lys 113tB with neighboring Glu 16 t with only minor conformational alterations of the 
LRM-2B. 

In area 1, Glu 18 , a specific residue of BDNF (Bradshaw et a/., 1994), forms a salt 
5 bridge with the LRM-2B amino acid Arg 9418 Residue Glu 18 is also involved in intramolecular 
electrostatic interactions with Lys 57 . 

Area K is the only region of the distinct BDNF/LRM-2B interactions which is not of 
ionic nature. The positively charged residue Lys 104tB forms multiple H-bond contacts with the 
BDNF loop I backbone cartonyl groups and the neutral Thr 35 side chain. The energy of such 

10 H-bonding is much lower than that of salt bridge contact, and, therefore, a substitution of the 
uncharged amino acid Thr 35 by a negatively charged residue would stabilize the particular 
structure of the BDNF/LRM-2B complex. In accordance with the present model, the 
experimental substitution of BDNF residues 34-41 by the corresponding segment of NGF, 
which includes the substitution of negatively charged residue Glu 35 for Thr 35 , resulted in a 

1 5 significant increase in bioactivity of the resulting chimeric molecule relative to wild type BDNF 
in TrkB mediated survival (Lai et a/., 1996). 

In addition to these five distinct binding areas of the BDNF/LRM-2B complex, there is 
one more area which reinforce binding specificity. The side chain of Pro 60 , a specific residue 
of BDNF (Bradshaw et a/., 1994), demonstrates a perfect geometric fit into the hollow in the 

20 "zigzag"-shaped p-strand region of the LRM-2B (Figure 11b). This geometric 

complementarity indicates that Pro 60 "recognizes" the p-strand motif of the receptor binding 
site. This interaction, obviously, does not significantly contribute to the total energy of the 
complex in the case of a perfect fit, but does represent a steric hindrance when this fit cannot 
be readily achieved. Accordingly, BDNF is very selective and does not bind non-cognate Trk 

25 receptors. 

Figure 11c illustrates the VBMC-generated geometry of the "improper NGF/LRM-2B 
complex. In this complex, specific binding areas F and I are absent due to lack of properly 
charged residues at positions 20 and 81 , which would correspond to BDNF residues Glu 18 
and Arg 81 , respectively. In addition, residue Pro 60 , determining the proper BDNF/LRM-2B 

30 orientation, is substituted in NGF molecules (Bradshaw et a/., 1994). These substitutions 

cause dramatic conformational and orientational alterations in the complexed LRM-2B domain 
(compare Figure 11b and 11c). Predicted Cartesian 3-dimensional coordinates of atoms of 
the TrkB LRM-2B binding site in the complex with BDNF are presented in Appendix 2. 
UGAND-RECEPTOR AFTERACTION EWERGY 

35 NGF/TrkA Interactions 

Table 3 presents the components of the intermolecular energies of the NGF/TrkA 
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complexes. Stability of the NGF/LRM-2A complexes results primarily from attractive 
electrostatic and van der Waals interactions, with intermolecular H-bonding playing a 
secondary role. It should be noted, however, that in the framework of the molecular 
mechanical approach, the H-bonding energy is partially incorporated into the electrostatic 
5 interaction energy term, so the tabulated E^, values reflect only part of the H-bonding energy. 
Accordingly, intermolecular H-bonding is more important for the NGF/LRM-2A interaction than 
it appears in Table 3. On the other hand, the intermolecular component of the E qc term 
seems to be insignificant. The reason for the latter is that the aromatic moieties of Phe 7 , 
Trp 76 Phe 105tA . Phe ,,,lA and Phe 113 * are ••preorganized" for intermolecular contacts with their 
1 0 counterparts in the complexes in electrostatically attractive geometry in which the 

quadrupole-quadrupole interaction energy term is not significant. In addition, hydrophobic 
ligand-receptor interaction energy is presumably extremely important for NGF/LRM-2A 
complex formation, but rigorous quantitative evaluation is currently not possible. 

Since neither electrostatic interactions nor hydrogen bonding can significantly stabilize 
1 5 a ligand-receptor complex in a water environment, the stability of the complex generally 
results from van der Waals and hydrophobic interactions (Clackson and Wells, 1995). 
Considerable contribution of intermolecular van der Waals interaction energy (Table 3) 
arises from the geometric complementarity of the NGF binding domain and the TrkA 
recognition site, such that the high value of the term is due to a large number of 
20 intermolecular van der Waals contacts. This complementarity is a consequence of the nght- 
handed twists of both the NGF amino-carboxyl termini complex and the LRM-2A. this 
geometric feature being characteristic of each of these domains in both uncomplexed (Kobe 
and Deisenhofer. 1993; Shamovsky et a/.. 1996) and complexed conformations. Thus, purely 
geometric ligand-receptor compatibility together with distinct hydrophobic contacts play a 
25 crucial role in NGF/TrkA recognition, while attractive electrostatic interactions and H-bond.ng 
further reinforce specificity of binding, which is in agreement with generalizations made by 
Clackson and Wells (1995). 
BDNF/TrkB and NGFfTrkB Interactions 

Table 4 presents the components of the potential energies of the complexes of BDNF 
and NGF with the LRM-2B in their minimum-energy conformations. Data presented in section 
A of Table 4 indicate that the difference in total energies (92.0 kcal/mol) is almost exclusively 
a consequence of more favorable electrostatic interactions in the BDNF/LRM-2B complex 
than in the NGF/LRM-2B complex (negative control). Section B of Table 4 illustrates that the 
BDNF complex is also more favorable in terms of intermolecular electrostatic energy by 77.2 
35 kcal/mol (85% of the difference in total electrostatic energies). On the contrary, intermolecular 
van der Waals interactions are more favorable in the -improper" NGF/LRM-2B complex. 
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Furthermore, the value of intermolecular van der Waais energy is significantly lower for 
NGF/LRM-2B than for the NGF/LRM-2A complex, whereas electrostatic contribution is more 
than twice as high. Consequently, in contrast to the NGF/TrkA binding pattern, molecular 
recognition of BDNF by the TrkB receptor binding site is almost exclusively based on 
5 electrostatic complementarity, whereas the roles of van der Waals and hydrophobic 

interactions are insignificant. At the same time, such a perfect electrostatic fit could not be 
achieved without a purely geometric compatibility of the BDNF and the LRM-2B binding 
domains in size and shape (Figure 11b). 
INDUCED CONFORMATIONAL CHANGES 

10 The NGF Amino and Carboxyl Termini 

Conformations of the NGF amino and carboxyl termini remain almost the same after 
complexation with the LRM-2A (Table 5; compare Figure 10a with 10b, 10c, 10d and 10e). A 
few local conformational changes occur in the Arg 114 ' and the Lys 115> side chains, and in the 
backbones of residues Ser 2 and Phe 7 . As predicted above, conserved charged residues 

15 Arg 114 ' and Lys 115 ' do not participate in principal intermolecular interactions within the NGF 
termini but are expected to form ionic intermolecular contacts with the TrkA binding site. 
Indeed, these are the intermolecular ionic NGF/LRM-2A interactions in area C, which change 
considerably the side-chain conformations of Arg 114 ' and Lys 115 ' while keeping their backbone 
geometry intact. The LRM-2A-induced conformational changes in residue Phe 7 are caused 

20 by the intermolecular H-bonding in area E while the geometry of Ser 2 is slightly changed to 
accommodate the LRM-2A residue Thr 114tA . The rest of the structure of the LRM- 
2A-complexed NGF termini retains the energy minimum conformation of the uncomplexed 
molecules because it is controlled by multiple intramolecular H-bonds. Consequently, the 
geometry of the NGF termini is "preorganized" for complexation with the LRM-2A, and only 

25 local conformational changes, primarily in residues Arg 114 ' and Lys m , occur upon 

complexation. Geometric "preorganization" of the NGF termini as a ligand for binding to a 
leucine-rich motif is consistent with X-ray studies of Kobe and Deisenhofer (1995), who have 
demonstrated that ribonuclease A maintains a fixed geometry upon binding to the LRMs of 
ribonuclease inhibitor. 

30 The LRM-2A Domain 

Figure 12 illustrates the isolated structure of the LRM-2A from the hNGF/LRM-2A 
complex. Table 6 presents the equilibrium torsional angles of the hNGF-bound LRM-2A. As 
seen, the side chains of the general consensus residues, Leu 93 *, Leu 96 * He 96 *, Leu 10 ** and 
Phe 111tA (Schneider et bL, 11988; Schneider aind Scfaw©ig@r f 1991), form the hydrophobic core, 
. 35 similarly to their conformations in intact leucine-rich repeats (Kobe and Deisenhofer, 1993). 
Terminal consensus residue Leu 117tA is solvent-exposed due to a boundary effect. Amino 
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acids Ala 107tA , Pro 106lA , Ala 110tA and Phe 113tA are also incorporated into the LRM-2A hydrophobic 
core. On the other hand, residues Va! 99 ", Phe 105tA , Val 106tA and Pro 115 " are not involved in 
hydrophobic interaction within the core, and apparently have specific roles in the recognition 
site of TrkA. The most hydrophilic residues of the LRM-2A, namely Arg 94 " Lys 100tA , Arg 104tA , 
5 Asp 109lA , His 112lA and Arg 116IA are exposed from the hydrophobic core in the complexes. Amino 
acids Lys 100lA , Asp 109iA and His 112tA participate in specific intermolecular interactions, while the 
other three are solvent-exposed. The reasonable locations of the hydrophobic and 
hydrophilic residues of the LRM-2A within the NGF/LRM-2A complexes with respect to water 
environment supports the calculated structures. 

10 NGF/LRM-2A binding spans almost the whole LRM-2A domain, i.e., an interval 

between residues Leu 93 * and Pro 115tA comprising three distinct regions: 0-strand, a-helix, and 
loop connecting the carboxyl terminus of the (J-strand and the amino terminus of the a-helix. 
This is in agreement with recent crystallographic studies on the ribonuclease A - porcine 
ribonuclease inhibitor complex (Kobe and Deisenhofer, 1995), in which these particular 

1 5 regions of leucine-rich repeats of ribonuclease inhibitor are also involved in the molecular 
recognition process. While the (J-strand structure exists in the complexed LRM-2A, the 
a-helix motif is destroyed by intermolecular interactions with NGF. Accordingly, the LRM-2A 
(J-strand determines the proper orientation of the ligand with respect to the recognition site of 
TrkA, but does not undergo noticeable conformational changes. Consequently, the 

20 cooperative ligand-receptor H-bonding in area E cannot directly mediate those changes in 
receptor conformation which trigger receptor phosphorylation processes. Available 
experimental data confirm this point. First, the mNGFA3-9 deletion mutant, which does not 
have the [J-strand motif and, consequently, cannot specifically bind area E, is as active as 
wild type mNGF (Drinkwater et a/., 1993). Second, the 24-residue LRM fragment 

25 Thr^-Leu 120 * lacking the P-strand region (Kobe and Deisenhofer, 1993), demonstrates the 
same kinetics of binding to NGF as wild type TrkA (Windisch ef a/., 1995a, 1995b). 
The LRM-2B Domain 

Table 7 presents the equilibrium torsional angles of the LRM-2B in the complex with 
BDNF. Side chains of all general consensus residues of the LRM-2B, except for Phe 11ttB and 

30 terminal amino acid Leu 117tB ( form the hydrophobic core and thereby stabilize the 

characteristic hairpin shape of the LRM (Figure 1 1 b). In contrast with the LRM-2A, aromatic 
residues of the LRM-2B, namely Phe 105IB f Tyr 108IB and Phe 111lB , do not seem to have any 
specific roles in BDNF/LRM-2B molecular recognition. They form a cluster in which their 
aromatic rings interact with ©ach other sind are about mutually perpendicular (Figyre 11b). On 

35 the contrary, all charged residues of the LRM-2B, namely Arg^ tB , Asp 100iB , lys 104lB , Lys 10StB and 
Lys 11318 , are j nvo | veC j i n specific electrostatic interactions with BDNF. 
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Similar to the NGF/LRM-2A complex, almost the whole LRM-2B domain, from Leu 93TB 
to Lys 113tB , fits to the BDNF binding determinant (Figure 11b). On the other hand, there are 
significant differences of the neurotrophin-induced conformational alterations in the two LRM 
domains. First, since the BDNF binding domain does not have a p-strand motif which would 
5 fit to the p-strand of the LRM-2B, the characteristic geometric features of the latter are lost. 
Second, region Tyr 10stB to Asn 116lB retains its a-helical structure in the BDNF-bound LRM-2B 
domain. The reason of this conformational stability is that the spacer between the two BDNF- 
bound residues, Lys ,09lB and Lys 113lB (3 residues), is consistent with the period of an a-helix 
(3.5 residues). 

1 0 PRINCIPAL AMINO ACIDS INVOLVED IN MOLECULAR RECOGNITION 
Principal NGF Residues 

The amino acids of NGF which are primarily responsible for maintaining bioactive 
conformations --(McDonald et sL, 1931; Shamovsky era/., 1996) or which directly participate in 
NGF/TrkA recognition are specified in Figure 13. Almost all residues of the amino and 
1 5 carboxyl termini of NGF as well as the His 75 '-Asn 7r region are involved in specific intra- or 
intermolecular interactions. Only five amino acids of the 30-residue binding domain of NGF 
do not seem to be essential for either maintaining its bioactive conformation or binding to the 
recognition site of TrkA. These include Ser 1 , Ser 2 , Set 3 , Ala 11 * and Val 117 '. 

The observation that the important amino acids of the NGF binding domain are 

20 conserved across all species while the apparently unimportant five residues are variable 
(Figure 13) (McDonald et a/., 1991) supports the particular structural features of the 
NGF/LRM-2A complexes calculated herein. There are, however, exceptions which require 
explanation. In most cases, principal amino acids of NGF molecules derived from different 
species appear to be substituted by similar residues, e.g. I6V, F7L, L112I, S1 13T or R114K. 

25 in some special cases, the substituted amino acid is quite different but the particular 

substitution is still consistent with the present model, e.g. R9M, R9L or R118Q (McDonald et 
a/., 1991 ; Shamovsky et a/. ( 1996); in the case of the H8N substitution (McDonald et a/., 
1991), histidine and asparagine have the same topology with respect to H-bonding to the 
Glu 11 carboxyl oxygen. The structure of snake NGF (sNGF) requires special consideration. 

30 Principal hydrophobic residues Phe 7 and Phe 12 are replaced by histidine residues in sNGF. 

Although these mutations could be accompanied by structural differences in snake TrkA ( they 
are consistent with the present model. According to the survey of Warme and Morgan 
(1978), side-chain interactions of histidine with tryptophan or threonine are very common in 
natural proteins, which implies the existence of specific inter-residue affinities. This being the 

35 case, attractive interactions His 7 ..Thr 114tA and His 12 .,.Trp 76 " in areas B and A, respectively, 
may replace corresponding hydrophobic interactions with Phe 7 and Phe 12 . 
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Principal LRM-2A Residues 

The principal residues of the LRM-2A are shown in Figure 14. The general consensus 
residues (Schneider et aA, 1988; Schneider and Schweiger, 1991) maintain the specific 
geometry of the LRM (Kobe and Deisenhofer, 1993, 1994, 1995). The amino acids which 
5 form the distinct binding areas are involved in specific intermolecular interactions with the 

NGF binding domain. Since NGF does not activate the TrkB receptor, those residues which 
are specific for the TrkA receptor have been postulated to be directly involved in NGF/LRM- 
2A recognition (Windisch etal. % 1995a, 1995b). As shown, the LRM-2A segment has nine 
specific residues, seven of them directly participating in NGF/LRM-2A recognition in the 

10 framework of the present model. Thus, Lys 100IA binds Asp 16 in area D, Pro 108tA is completely 
buried inside the NGF/LRM-2A complex, residues Asp 109tA , His 112lA and Pro 115lA form binding 
area C, and residues Phe 113lA and Thr 114tA are involved in forming area B. The other specific 
residues, Arg 1wtA and Arg 116lA , do not seem to be required for specific NGF/LRM-2A 
interactions. It is seen, furthermore, that binding areas A and E are not specific for the 

1 5 recognition site of TrkA, indicating that binding specificity of the LRM-2A is determined by 
Lys iooiA Asp io9tA and phe 113tA . The fact that these particular amino acids are replaced in the 
recognition site of TrkB by residues of a different chemical nature, namely Asp 100lA , Lys 109tA 
and Lys 113IA , respectively (Windisch et a/., 1995a, 1995b), making NGF/TrkB binding 
impossible, supports the herein predicted details of NGF/LRM-2A recognition. Principal 

20 residue Phe 105tA , although common to both the TrkA and TrkB recognition sites (Windisch et 
a/., 1995a, 1995b), is also important for the NGF/TrkA recognition process, since oxidation of 
its counterpart, Trp 7F , results in a complete loss of receptor binding (Merrell et a/., 1975). 
Principal BDNF residues 

Since BDNF/TrkB molecular recognition is predicted to be primarily based on 

25 electrostatic complementarity, principal BDNF and LRM-2B residues are identified by 

analysing their contribution to the intermolecular electrostatic energy. Table 8 presents a 
comparison of individual contributions of residues of BDNF, NGF and the LRM-2B to the 
intermolecular electrostatic energy for the BDNF/LRM-2B and the NGF/LRM-2B complexes, 
with cutoff being ±10 kcal/mol. BDNF/T rkB molecular recognition is achieved by those 

30 residues that make significantly more favourable contacts with their counterparts than do 

corresponding residues in the "improper NGF/LRM-2B interaction. Accordingly, in section A 
of Table 8, residues of BDNF that make more favourable contacts with the LRM-2B than do 
those of NGF by at least 10 kcal/mol, are highlighted. They are: Glu 18 , Asp 24 , Glu 55 , Arg 81 and 
Gin 8 *. These residues make up the BDNF binding epitope for TrkB activation. Only two of 

35 them, Arg 81 and Gin 84 , have been previously identified (Ibanez et ©/., 1993). The most 

unusual case is residue Gin 84 , which does not markedly participate in electrostatic interactions 
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with the LRM-2B; however, its substitution by His M results in considerable repulsive 
interactions with positively charged residues of the receptor site. Therefore, Gin 64 cannot be 
referred to as a principal residue by itself in terms of the TrkB receptor binding, and its 
apparent importance (Ibanez et a/., 1993) comes from unfavorable substitution by a positively 
5 charged His w . The only residue of NGF which binds to the TrkB receptor site significantly 
more strongly than does the corresponding residue of BDNF is Glu 35 , in agreement with 
observations (Lai et a/., 1996). 
Principal LRM-2B residues 

Section B of Table 8 identifies the LRM-2B residues which significantly contribute to 

10 specificity of binding. Amino acids Arg 94 " 3 , Asp 10018 , Ser 10ltB , Lys 109tB and Lys 113tB make 

significantly more favorable contacts with their counterparts in BDNF than with those in NGF. 
Additionally, these residues are responsible for specific binding in areas F-K. The only 
residue of the LRM-2B that favours the NGF counterpart is Lys 1MtB , which is caused by the 
abovementioned effect of Glu 35 . 

1 5 Figure 15 illustrates schematically the geometry of the LRM-2A and the LRM-2B 

receptor sites. Those residues which are predicted to form specific bonds with their preferred 
ligand are shown by arrows. The similarity of the binding patterns of the two LRMs is striking, 
especially bearing in mind the differences pointed out in the nature of the intermolecular 
interactions, and that the binding patterns were identified in independent computations. 

20 Thus, binding specificity of TrkA and TrkB is determined by the residues located in distinct 

positions of their second LRMs, with their respective binding patterns being incompatible with 
each other. 

It is shown that van der Waals and hydrophobic interactions play a more important 
role in TrkA than in TrkB recognition. As a part of this phenomenon, we have demonstrated 

25 that NGF/LRM-2A binding is characterized by greater geometric complementarity, such that 
the ligand-receptor complex does not contain marked internal cavities. A plausible 
explanation for these particular features of Trk receptor recognition is that the NGF binding 
determinant is flexible and, because of that, more is required to stabilize its bioactive 
conformation in solution. Accordingly, TrkA receptor binding involves physical factors that are 

30 more effective in an aqueous environment, namely van der Waals and hydrophobic 
interactions. 

EXPLANATION FOR STRUCTURE-ACTIVITY RELATIONSHIPS 

TrlcA mediated activity has been evaluated for 27 point and deletion mutants of the 

TrkA binding domain of NGF: foNGF (Burton otaL, 1992; Shih ©fa/., 1994), mNGF (Burton et 
35 a/., 1992; Drinkwater et al. t 1993), mNGFA3-9 (Drinkwater et ai , 1993), (1-120)h6MGF 

(Schmelzeref a/., 1992), (1-120)mNGF (Luo and Neet, 1992), (1-117)hNGF (Schmelzer et a/., 

41 



BNSDOCID: <WO. 



9B06048A2J_> 



WO 98/06048 



PCT/CA97/00539 



1992) and hNGF:NT-4/5 (Treanor e/a/., 1995) retain full activity; mNGFA2-8. mNGFA61-66 
(Drinkwater ef a/., 1993), hNGF-R9A, hNGFA1-5 and hNGF-P5A (Shih et a/., 1994) 
demonstrate intermediate potency; hNGFA1-9 (Burton et a/., 1992; Kahle etat., 1992; Luo 
and Neet, 1992; Drinkwater et a/., 1993; Shih et at., 1994; Burton et a/., 1995), mNGFAl-8 
(Taylor et ai. t 1991; Burton etai. % 1992), mNGFA1-9, mNGFA9-13 (Woo ef a/., 1995), 
mNGFA3-8, mNGFA3-11, mNGFA3-12, mNGFA3-13, mNGFA3-14, mNGFA9-14, 
mNGFA112-118 (Drinkwater et ai , 1993), hNGF-H4A, hNGF-H4D (Shih efa/., 1994), BDNF 
(Suter et a/., 1992), and NT-4/5 (Treanor et a/., 1995) are almost or completely inactive The 
present model of NGF/LRM-2A recognition enables a comprehensive explanation for the 
structure-activity relationships. 

As demonstrated, fully active hNGF, mNGF and hNGF:NT-4/5 fit to the five binding 
areas A-E of the recognition site of TrkA, while the mNGFA3-9 deletion mutant fits only to A, 
C and D, without any specific binding taking place in areas B or E. Therefore, there are three 
common geometric features of the four fully active molecules which distinguish them from the 
less active analogues. First, conformations of the amino and carboxyl termini of these NGF 
analogues are "preorganized" for binding to the principal LRM-2A binding areas A, C and D. 
Second, there is no stereochemical incompatibility in the binding areas B and E Third, there 
is no hole inside the NGF/LRM-2A complexes. 

The structures of the other three fully active NGF analogues, (1-120)hNGF, 
(1-120)mNGF and (1-117)hNGF, are consistent with the above criteria. Indeed, since the 
NGF carboxyl end is located on the surface of the NGF/LRM-2A complexes (Figure 10) and 
does not directly participate in principal intemnolecular interactions, the lengthening of the 
NGF carboxyl terminus by two amino acids does not explicitly affect NGF/LRM-2A 
compatibility. As far as the (1-1 17)hNGF molecule,is concerned, terminal residue Arg" 8 * of 
NGF has been shown to form inter-chain ionic H-bonds with Glu 11 (see above) and, hence, 
seems to be important in maintaining the structure of the NGF bioactive conformation. On 
the other hand, there is a second factor stabilizing the bioactive conformation of the hNGF 
termini, namely inter-chain H-bonding of residue Arg 9 . Because of the latter binding, the 
above criteria are still fulfilled for (1-117)hNGF. 

The deletion of principal residue Phe 12 , which participates in hydrophobic interactions 
in area A, results in a substantial loss of TrkA mediated activity of the mNGFA3-12, 
mNGFA3-13 t mNGFA3-14 and mNGFA9-14 deletion mutants (Drinkwater et ai, 1993). 
Likewise, the absence of principal residues Arg 114 ' and Lys 115 , essential for the ionic binding in 
area C ( results in dramatic toss of bioaciivity of mNGFAl 12-118 (Drinkwater et a/., 1993). 
Similarly, geometric perturbations of the essential residues Arg 59 and Arg® 9 in the 
mNGFA61-66 deletion mutant result in incomplete NGF/LRM-2A compatibility in area D and, 
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accordingly, a considerable loss of activity. 

The specific location of the NGF ninth amino acid side chain in the "hollow" existing 
on the surface of the NGF rigid region has been predicted herein to be critical for high TrkA 
mediated activity. The present invention provides a detailed explanation for this prediction. 
That is, the "hollow" is buried inside the NGF/LRM-2A complex, and a moiety located in this 
"hollow" fills the cavity which otherwise would exist inside the complex and destroy NGF/LRM- 
2A compatibility. Since this moiety is located in the internal cavity of the complex (Figure 10), 
severe limitations are imposed on its size. A short side chain does not fill this cavity; 
therefore, the hNGF-R9A mutant demonstrates noticeably diminished activity (Shih et at., 
1994). The cavity preferably is filled either by a bulky amino acid side chain, such as arginine 
(e.g. hNGF), methionine (e.g. mNGF) or leucine (sNGF), or by the two residue peptide 
backbone (mNGFA3-9 or mNGFA2-8). A longer amino terminus tail (e.g. in mNGFA3-8, 
mNGFA9-13 or mNGFA1-5) cannot be accommodated in the internal cavity of the NGF/LRM- 
2A complex and, hence, interferes with binding areas B and E. 

Most deletion or point mutations of the NGF amino terminus destroy the exposed P- 
strand motif Phe 7 -Arg 9 and change the proper orientation of the Phe 7 side chain, thereby 
affecting NGF/LRM-2A compatibility in areas B and E. Thus, incorporation of a negatively 
charged residue into the relatively long NGF amino terminus, as in hNGF-H4D, BDNF, NT-3 
and NT-4/5, or substitution H8R, which is characteristic for BDNF and NT-4/5, has each been 
demonstrated to cause dramatic conformational changes in the complex of the neurotrophin 
amino-carboxyl termini, making it incompatible with the LRM-2A. This stereochemical 
incompatibility of the geometry of the NGF amino terminus with binding areas B and E of the 
LRM-2A is the reason for the partial or complete loss of TrkA-related activity of mNGFA3-8, 
mNGFA1-5, hNGF-P5A, hNGF-H4A, hNGF-H4D, mNGFA1-8, hNGFA1-9 t mNGFA1-9, 
mNGFA9-13, mNGFA3-11, BDNF and NT-4/5. 

As demonstrated, both NGF monomers are directly involved in specific ligand-receptor 
interactions, i.e., the first monomer binds areas B, D and E, whereas the second one binds 
areas A and C. Consequently, the dimeric structure of NGF is necessary to retain high 
biological activity, which is also consistent with experimental data (Maness et a/., 1994). On 
the contrary, only one BDNF protomer is essential for TrkB binding (Figure 11a, 11b). 

In addition, recent reports of experiments have demonstrated the importance of 
residues Phe 7 (Kullander and Ebendal, 1996), Leu 112 , Arg n4 and Lys 115 (Kruttgen e/a/., 1996) 
for TrkA mediated activity of NGF. This is consistent with their participation in the specific 
NGF/LRM-2A binding in th© framework of the present model. 
COWCLUSSONS 

The complexes of the amino and carboxyl termini of the four fully active NGF 
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analogues with the second leucine-rich motif of TrkA were obtained using Variable Basis 
Monte Carlo simulated annealing calculations. The same methodology was utilized to define 
the structure of the complex of the second leucine-rich motif of TrkB with the BDNF receptor 
binding determinant. The structural features of the resulting NGF/TrkA and BDNF/TrkB 
binding predictions are supported by the following: (i) geometric and electrostatic 
complementarity of the binding domains of NGF and BDNF to binding sites of TrkA and TrkB, 
respectively, without considerable conformational changes of the ligands; (ii) appropriate 
environments for hydrophobic and hydrophilic residues within the complexes; (Hi) analysis of 
structural variability of both binding domains; and (iv) comprehensive explanation of currently 
published structure-activity relationships. Within these complexes, NGF/TrkA molecular 
recognition is based upon multiple-point intermolecular interactions of hydrophobic, van der 
Waals. ionic and hydrogen bonding type. Extensive intermolecular van der Waals contacts 
arise from complementarity of the NGF binding domain and the TrkA recognition site in size 
and shape, in addition, there are five specific binding areas within the receptor model: 
hydrophobic A and B, ionic C and D and cooperative hydrogen bonding E. The NGF/TrkA 
stereochemical compatibility in areas A. C and D is required for TrkA receptor activation, 
whereas areas B and E reinforce binding specificity, making the NGF molecule a very 
sophisticated key triggering the chain of biological events following receptor phosphorylation. 
The 3-dimensional structures of the binding domains of the fully active NGF analogues are 
"preorganized" for stereochemical fit to the second leucine-rich motif of TrkA. Five distinct 
binding areas F-K were also identified in the BDNF/TrkB complex, although, in contrast with 
NGF/TrkA binding, BDNF/TrkB recognition is based primarily on ionic and hydrogen bonding 
interactions. In spite of significant differences in the nature of ligand-receptor interactions 
involved in TrkA and TrkB recognition processes, binding patterns of their receptor binding 
sites for NGF and BDNF are almost identical. The predicted mechanism of high affinity 
NGF/TrkA and BDNF/TrkB binding is consistent with the conformational model of TrkA-p75 NTR 
interactions (Chao and Hempstead, 1 995). 

PART C 

THEORETICAL MODELLING OF THE 975™ RECEPTOR RECOGNITION SITE FOR 

NEUROTROPHINS 

INTRODUCTION 

As summarized above, the neurotrophins represent a family of structurally and 
functionally related proteins which play a crucial role in the development, survival and 
maintenance of the sympathetic and sensory neurons (Purves. 1988; Snider and Johnson. 
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1989; Barde. 1989; Thoenen, 1991; Korsching, 1993; Persson and Ibanez. 1993; Davies, 
1994; Ibanez. 1995). The neurotrophin family consists of nerve growth factor (NGF), brain- 
derived neurotrophic factor (BDNF). neurotrophin-3 (NT-3), neurotrophin-4/5 (NT-4/5), and 
more recently discovered neurotrophin-6 (NT-6) (Berkmeier et a/., 1991; Barde, 1991; 
Barbacid, 1993; Bradshaw era/.. 1994; Chao. 1994; Gotz et a/.. 1994). Each neurotrophin is 
capable of binding independently with the common neurotrophin receptor p7S" r «, as opposed 
to specific Trk receptor interactions. 
METHODS 

The geometric features of complexes of monomelic neurotrophins with the p7S™ 
receptor binding site have been obtained by multiple molecular dynamics runs of 200-300 ps 
at constant temperatures between 300 and 400°K. followed by local energy refinements. The 
QUANTA V4.1 molecular modelling program (Molecular Simulations Inc.. Waltham, MA) with 
the united atom CHARMM potential energy terms (Brooks et a/.. 1983) has been used. 
Residues of p75 NTR are designated by abbreviation p. 

According to the invention, we superimposed the binding domains of NGF. BDNF 
NT-3 and NT-4/5 with a putative binding site of P 7S™ and carried out multiple molecular 
dynamics computations of the ligand-receptor complexes. These calculations revealed the 3- 
dimensional structures of the binding domains of the neurotrophins and p75™, demonstrated 
their geometric and electrostatic complementarity, and identified principal residues involved in 
receptor recognition. All neurotrophins are believed to induce similar conformational changes 
in the p7S m * binding site, generally located in the orientation of the sequential loops within 
the cysteine-rich repeats. 

Initial geometries of individual components of the complexes were based upon the X- 
ray crystallographic coordinates of neurotrophins NGF (McDonald et a/.. 1991), BDNF and 
NT-3 (Robinson era/.. 1995) and the p55 TNFR receptor extracellular domain (Banner et al. . 
1 993) which is homologous to p75™. Geometry of the NT-4/5 monomer was constructed 
from the X-ray derived structure of NT-3 (Robinson et a/., 1995). Since NGF and other 
neurotrophins fold in similar rigid conformations, their molecular motions were restricted by 
imposing the following atom constraints. First, peptide backbones of the neurotrophins were 
fixed at their crystallographic coordinates. Further, conformations of the amino acid side 
chains of the neurotrophins were also fixed except for those located within and in the vicinity 
of the variable loop regions I and V (the p75™ binding determinant). Thus, amino acid side- 
chain motions were allowed for residues 23-35 and 93-98 (NGF numbering) as well as for 
eight additional neighboring residues, namely His 84 (Gin in other neurotrophins). Phe 65 (Tyr in 
other neurotrophins). Lys 68 (Arg in other neurotrophins). Trp* 9 , Arg 100 . Phe 101 (Trp in NT-3 and 
NT-4/5). Arg' 03 . and Asp 105 . The flexible amino and carboxyl termini of the neurotrophins were 
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excluded to avoid artefactual spacial hindrances for the receptor binding. 

Fig. 16a represents a fragment of the crystallographically resolved structure of the 
p55 TNFR extracellular domain (Banner et a/., 1993), corresponding to the second CRD (loops 
2A and 2B) and the first loop of the third CRD (loop 3A). Fig. 17 illustrates the sequence of 
5 this fragment and its alignment with the three known p75 NTR peptides (Johnson et a/., 1986; 

Radeke et a/., 1987; Large et at., 1989). Fig. 16b presents the minimum energy conformation 
of the corresponding fragment of human p75 NTR . As is apparent, its size and shape are 
complementary to the exposed p75 NTR binding determinant of NGF (Fig. 16c). Accordingly, 
this 5B-amino acid fragment of p75 NTR ( including the second and the beginning of the third 

10 CRDs (Fig. 17), has been considered as a putative binding site for neurotrophins. The 

artefactual charges on the terminal amino and carboxyl groups of the peptide representing the 
p75 NTR binding site were neglected. 

Once the most stable equilibrium 3-dimensional structures of the complexes being 
studied were obtained, the residues contributing the most to the energy of intermolecular 

1 5 interactions were identified Point atomic charges of Weiner et al. (1984) and a dielectric 
constant equal to the interatomic separation in angstroms (to implicitly simulate solvation 
effects) (Weiner etal. t 1984) were utilized to calculate the electrostatic intermolecular terms. 
The van der Waals intermolecular terms were evaluated by the united atom CHARMM force 
field (Brooks et a/., 1983). The calculations were performed on a 200 MHz lndigo-2 

20 SiliconGraphics Workstation. 

RESULTS AND DISCUSSION 

The most stable structures of the complexes of the neurotrophins with the putative 
p75 NTR receptor binding site obtained by the molecular dynamics computations described 
herein are illustrated in Fig. 18. Table 9 presents net values of the electrostatic and van der 

25 Waals terms of the ligand-receptor interaction energy. Tables 9 and 10 list individual 

residues of the neurotrophins and the p75 NTR receptor which significantly contribute to the 
intermolecular electrostatic interaction energy. The highlighted residues represent the 
functional epitopes of the ligands (Table 9) and the receptor (Table 10), whereas the other 
listed residues, although being located within the binding interfaces, apparently do not 

30 participate in major ligand-receptor electrostatic interactions in the particular complex. 

Predicted Cartesian 3-dimensional coordinates of atoms of the p75 NTR binding sites in the 
complexes with NGF and BDNF are presented in Appendix 3. 

Significant contributions of electrostatic interactions to the ligand-receptor interaction 
energy (Table 9) result from opposite net charges of the binding domains of neurotrophins 

35 (positively charged) and the common neurotrophin receptor (negatively charged), which effect 
is consistent with published predictions (Ryden et a/., 1995). Data presented in Tables 9 and 
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10 demonstrate that, on the average, the interaction between the functional epitopes is 
responsible for 94% of intermodular electrostatic energy, but only 43% of van der Waals 
interaction energy. Consequently, residues participating in strong and specific ligand-receptor 
.nteract,ons represent only a part of a bigger binding interface in which significant van der 
5 Waals contribution is accumulated from a large number of weak contacts, which is possible 
only as a consequence of geometric complementarity of the binding domains Fig 19 
presents the structure of the BDNF/p75 NTR complex as atomic van der Waals spheres and 
demonstrates the geometric fit of the binding domains. The van der Waals energy and 
therefore, geometric complementarity of the binding interfaces in the NGF/p75 NTR complex is 
1 0 significantly lower than in the other neurotrophics™ complexes (Table 9) which is 

consistent with its fastest rate of dissociation (Ibanez, 1995). However, the pure geometric 
factor cannot explain the slowest rate of dissociation observed for the BDNF/p75 NTR complex 
(Ibanez. 1995). Thus, stability of the complex arises from a combination of geometric and 
electrostatic complementarities of the binding interfaces, consistent with the generalizations 
of Clackson and Wells (1995). A specific feature of the neurotrophics™ binding 
interfaces is that they do not include substantially hydrophobic residues, so they do not have 
a hydrophobic core, which has been found in many other protein-protein complexes 
(Clackson and Wells. 1995; Livnah eta,., 1996). The predominant interactions within the 
binding domains of the complexes under study are of the salt-bridge type. 

Specificity of binding of the neurotrophins to the P 75 NTR receptor is determined by 
charged residues within the binding domains (Ryden era/., 1995). As seen in Fig 18 they 
create 3-dimensional networks in which most of them are in contact with more than one 
res,due of the opposite charge. Therefore, any missing charge would destroy more than just 
one bond, but a significant portion of. if not the whole, network. That is, the electrostatic 
25 complementarity of the binding interfaces would be altered, which is consistent with the 

experimentally observed sensitivity of binding to mutations of charged residues (Ibanez et al. 
1992; Ryden era/., 1995; Ryden and Ibanez, 1996). Both ligand and receptor provide the 
network with positive and negative charges (Table 9 and 10, Fig. 18), even though negative 
charges dominate in the receptor, and positive in the ligands. This particular spatial 
arrangement of the charges increases specificity of binding and prevents electrostatic 
repulsions within the same binding domains. For example, in all the complexes positively 
charged residue Lys*" of the receptor forms salt-bridges with negatively charged residues 
Asp'* and Asp** of the receptor, thereby occupying one site of their coordination such that 
only the opposite sides of the tetter residues remain available for intermolecular ionic 
interactions (Fig. 18). Negatively charged residue Asp 93 of BDNF holds together positively 
charged residues Arg* and Arg™ of BDNF, such that they are both able to interact with 
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juxtaposed amino acids Asp"" and Asp 7 *> of the receptor (Fig. 18b). Similarly, negatively 
charged residue Asp 30 of the neurotrophic forms a salt-bridge interaction with Arg'°°. which 
directly interacts with p75 NTR . 

In spite of the differences in the binding interfaces of the respective complexes 
binding patterns of the p75™ receptor are similar. All three loops of the receptor binding site 
(2A, 2B and 3A) are involved in binding (Fig. 18). The principal residues of loop 2A are Asp<"> 
and Lys**; those of loop 2B are Asp 7 * and Asp**; and those of loop 3A are Asp** and Glu«» 
As is seen in Table 10, loop 2B forms the strongest contacts. In addition to the principal 
binding pattern, the p75™ binding domain comprises specific residues for ionic interactions 
with particular neurotrophic (Table 10. Fig. 18). Thus. Arg«» interacts only with Glu 36 of 
NGF; residue Glu 5 * interacts only with Lys* of BDNF, and Glu«» and Glu 7 *> interact only with 
residues of NT-3 and NT-4/5. 

Site-directed mutagenesis studies of the P 75™ binding site have suggested that 

, „,.„ v , U1C IBS1 uu es Msp — , Asp— or Glu^ to Ala does not affect NGF 

binding, whereas mutation of Ser*». which does not participate in major ionic interactions, 
results in the complete abolishment of binding (Baldwin and Shooter. 1995). The apparent 
discrepancy with the present theoretical studies can be explained, since the cited 
experimental observations are still controversial. Indeed, although both Asp 75 ' and Asp 76p are 
involved in ionic interactions in the loop 2B region with positively charged residues of 
neurotrophins. one of them might be enough to maintain binding to NGF. Likewise, the 
functional role of the missing residue Glu"* in loop 3A might be fulfilled by adjacent Asp 88 " 
Residue Ser 5 * of p75 MTR is predicted to be hydrogen bonded to Gin 96 of NGF (Fig. 18a). and 
its mutation to Asn. Ala or Thr. which abolishes receptor binding (Baldwin and Shooter. 
1995). may create steric hindrances in the NGF/p75 NTR interface. The present molecular 
modelling studies suggest that Ser*» is not important for binding of any other neurotrophin 
(Fig. 18, b-d). 

In contrast to the P 75 NT * receptor binding pattern, there are only two common 
residues of the binding interfaces of neurotrophins which directly interact with the receptor 
site, Arg™ and Arg 103 . Contribution of Arg 1 " is significant in all the complexes, while Arg 100 
30 seems to be less important for the NGF/p75 NTR binding. 

As is seen in Table 9, most important residues of the NGF binding domain are (in 
order of decreasing significance) Lys 32 , Lys*. Arg 103 , Lys 95 , and His", all of them forming 
specific ionic contacts with negatively charged residues of p75 NTR (Fig. 18a). Involvement of 
Lys 32 . Lys 34 , and Lys 95 in p75 NTR binding has been very well established experimentally, with 
the same order of significance being observed (Ibanez, 1994). The most important residues 
of NGF. Lys 32 and Lys* bind to the loop 2B residues of p75 NTR (Fig. 18a). Here we 
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demonstrate that residue Lys 88 is not critical for p75 mR binding, whereas residues Arg 103 and 
His w make direct ionic contacts with the receptor binding site. Importance of the latter 
residue has not been previously reported in the context of p75 NTR binding. The roles of Asp 30 , 
Glu 35 and Arg 100 seem to be secondary, since their contributions are relatively small. 
5 Hydrophobic residue lie 31 is not directly involved in ligand-receptor interactions. 

In BDNF, all three positively charged residues of variable region V, Lys 95 , Lys 96 and 
Arg 97 (Ryden et a/., 1995), bind to negatively charged residues of the p75 NTR binding site, 
namely Asp 47p , Glu 53p and Asp 75p , respectively (Fig. 18b). In addition, the residues Arg 88 and 
Arg 101 play a more important role than in the NGF case (Table 9); they bind to the loop 2B 

10 residues Asp 75p and Asp 76p of pTS" 1 *, thereby fulfilling the function of missing positively 

charged residues of loop I. The increased importance of residue Arg 88 in BDNF and other 
neurotrophins relative to that of Lys 88 in NGF (Table 9) is permitted by a longer side chain of 
Arg, which results in a decrease of the separation between opposite charges (Fig. 18). 
Because of missing residue His 84 , only the positively charged residue Arg 104 binds to the loop 

1 5 3A residues Asp** and Glu 8915 of p7S mR . 

In NT-3, the loop 2B residues of the receptor Glu 73p , Asp 75p , and Asp 76p bind to Arg 31 , 
His 33 , Arg 100 , and Arg 87 . Residue Asp 47p of loop 2A of the receptor also binds to Arg 100 . 
Residue Lys 95 of variable region V does not form any salt-bridge contact with the receptor, 
consistent with its reported insignificance for NT-3 binding (Ryden et a/., 1995). Thus, the 

20 present results agree well with the established roles of Arg 31 and His 33 of NT-3 (Ryden et ai , 
1995; Ryden and Ibanez, 1996) and predict the increased importance of residue Arg 100 in NT- 
3 for p75 NTR binding (Table 9). The binding pattern of NT-4/5 is similar to that of NT-3 (Fig. 
18, c.d). The increased electrostatic contribution of Arg 36 with respect to corresponding NT-3 
residue His 33 (Table 9) is caused by the longer side chain of the former, which results in 

25 formation of a salt-bridge contact with residue Glu 80 * 5 of the receptor (Fig. 18d). 

Binding of a neurotrophin induces conformational changes in the p75 mR binding site. 
The conformational space of the three loops (2A, 2B and 3A) is limited by eight disulfide 
bonds; therefore, the most significant changes occur in the junction areas of those loops. 
Fig. 20 illustrates the superimposed equilibrium geometries of the free and the BDNF- 

30 complexed pTS"™ binding site. The superimposition is performed by a least-square 

minimization technique using the a-carbon reference atoms of loops 2A and 3A. As is 
apparent, the orientation of loop 2B with respect to the two flanking loops is considerably 
changed upon binding. Because of striking similarities between the binding patterns of the 
p75 NTR receptor with the different neurotrophins, a similar conformational change takes place 
. 35 in gill four complexes under investigation. The driving force behind this particular 

conformational transition is ionic interactions of the principal residues of loop 2B ( Asp 75p and 
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Asp 6p l with positively charged residues of the neurotrophic taking into account that spacia! 
positions of the flanking loops 2A and 3A are fixed by ionic interactions of residues Asp 47p , 
Lys** Asp** and Glu 890 with corresponding residues of the neurotrophins (Fig. 18). Thus, 
structural variability between the neurotrophins does not preclude them from inducing similar 
5 conformational change in the common neurotrophin receptor. 

The binding pattern of neurotrophins to the pTS*™ receptor predicted by the present 
molecular modelling studies can be additionally verified by checking conservation of the 
identified principal residues. The sequences of the binding site of the p75 mR receptor from 
different species (Fig. 17) demonstrate that principal residues forming its common binding 
10 pattern, namely Asp 47p , Lys**, Asp 75p , Asp 76p , Asp*, and Glu 890 , are conserved in p75 NTR . The 
known variability of regions I and V among neurotrophins (Ryden et a/., 1995) implies that 
conservation of the principal residues of the p75 m * binding determinants may be checked 
only within the same neurotrophin derived from different species. Study of a recent sequence 
alignment of neurotrophins (Bradshaw et a/., 1994) allowed us to conclude that the functional 
1 5 epitope of the neurotrophins (Table 9) is generally conserved within the same neurotrophin 
even though there are some exceptions, often represented by the substitution by a similar 
amino acid. Arg 100 and Arg 103 are the only two residues of the binding epitopes which are 
conserved throughout all neurotrophins (Bradshaw et a/., 1994). Other residues within the 
functional epitopes maintain specificity of binding of different neurotrophins. 
20 The present model also represents a key to understanding the conservation of 

residues which do not directly participate in receptor binding. For example, Gly 33 is absolutely 
conserved in all neurotrophins, yet is not involved in any specific interactions with the receptor 
binding site. However, within the present model, its conservation can be understood, since it 
is located very close to loop 3A of the receptor and forms van der Waals contacts with it (Fig. 
25 18), such that any other residue would impose steric hindrance which would alter or destroy 
the ligand-receptor geometric complementarity. 

The principal NGF/TrkA binding areas necessary to trigger TrkA phosphorylation and 
subsequent biological events have been discovered by the inventors. Specifically, the 
inventors have discovered that essentially the full binding site of the second leucine rich motif 
30 (LRM) of TrkA, amino acid residues 93 to 1 17, forms the binding site for NGF, and comprises 
five distinct binding areas, A, B, C, D and E. Appendix 1 provides the Cartesian coordinates 
of the LRM-2A residues defining the areas A to E in PDB format. Binding areas A and B bind 
hydrophobically to ligands, areas C and D form ionic bonds with charged portions on ligands 
and area E forms hydrogen bonds with ligands. 
35 The stereochemical fit to at least three of these binding sites, A, C and D by a ligand 

is necessary to activate TrkA receptor. Binding areas B and E reinforce stereoselectivity of 
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the TrkA binding site by selecting an NGF analogue with a stereochemical^ compatible amino 
terminus. 

Binding area A of the second LRM comprises hydrophobic interaction of residue 
Phe 105lA Binding area B comprises hydrophobic interaction of residues Phe 111lA , Phe 113lA and 
5 Thr 114tA . Binding area C comprises attractive ionic interaction between a suitable portion on 
the ligand and Asp 109tA . Binding area D comprises attractive ionic interaction between a 
suitable portion on the ligand and residue Lys 100lA . Binding area E comprises multiple parallel 
P-strand type hydrogen bonding with region Asn^-lle 98 * 

Main structural features of the bioactive conformation of NGF include: 1) ionic 

10 hydrogen bonding between Glu 11 in the amino terminus and Arg 118 * in the carboxyi terminus; 2) 
separation of the flexible loop 1-8 from the rigid region (9-11 and 112' to 118') because of 
electrostatic repulsions caused by His 4 and conformational restrictions imposed by Pro 5 ; 3) 
packing the side chain of Arg 9 or Met 9 in the "hollow" of the rigid region, this "hollow" being an 
indentation in the protein structure by residues Lys 115 '-Arg 116 ; and 4) an embryonic 3-sheet 

1 5 structure present between the a-cartidns of the 7th and 9th residues. 

Within NGF, residues Trp 7 *\ Arg 114 , Lys 115 ' and Asp 16 comprise the pharmacophore 
which binds TrkA. Any synthetic molecule, whether peptide or non-peptide, which fits into 
binding areas A, C and D is contemplated to elicit a biological response (agonist) or block 
biological response (antagonist). 

20 The present invention provides a ligand for binding with TrkA, wherein TrkA comprises 

a second leucine rich motif (LRM) region comprising amino acid residues 93 to 117, the 
coordinates of the residues of said LRM given in Appendix 1 , said LRM motif having five 
binding areas, area A comprising hydrophobic interaction of amino acid residue Phe 105tA , area 
B comprising hydrophobic interaction of amino acid residues Phe 111tA , Phe 113lA t and Thr 114tA , 

25 area C comprising ionic interaction of amino acid residues Asp 109tA and His 112tA , area D 

comprising ionic interaction of amino acid residue Lys 100lA , and area E comprising multiple 
parallel p-strand type hydrogen bonding of region Asn 95 * to He 98 *. The ligand comprises at 
least one functional group being capable of hydrophobic bonding and being present in an 
effective position in the ligand to hydrophobically bind to area A, at least one positively 

30 charged functional group being present in an effective position in the ligand to ionically bind to 
area C, and at least one negatively charged functional group being present in an effective 
position in the ligand to ionically bind to area D. 

Secondary features of the ligand include at least one effective functional group 
present in a position in the ligand to hydrophobically bind to area B of TrkA, and a functional 

35 group present in an effective position in the ligand to hydrogen bind to area E of TrkA. 

The invention further provides the ligand comprising amino acid residues, at least one 
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of the residues being capable of hydrophobic bonding and being present in an effective 
position in the ligand to hydrophobically bind to Phe 105tA in area A on the TrkA receptor, at 
least one of the residues being a positively charged residue present in an effective position in 
the ligand to ionically interact with Asp 109lA and His 112tA forming area C on the TrkA receptor, 
5 and one of the residues being a negatively charged residue present in an effective position in 
the ligand to ionically bind to Lys 100lA in area D on the TrkA receptor. 

The present invention also provides a method of designing a ligand to bind with an 
LRM binding site on TrkA. The method includes computationally evolving a chemical ligand 
using an effective genetic algorithm with preselected spatial constraints so that the evolved 

1 0 ligand comprises at least three effective functional groups of suitable identity and spatially 
located relative to each other in the ligand so that a first of the functional groups 
hydrophobically interacts with binding area A, a second of the functional groups ionically 
interacts with binding area C, and a third one of the functional groups ionically interacts with 
binding area D of the second LRM of TrkA. Those skilled in the art will be aware of the 

15 various techniques which may be used for de novo ligand design. For example, packages 
such as Quanta, Discover and Insight provide computational methodologies for generating 
ligands. 

Some ligands according to the present invention may be peptides or peptidomimetics. 
Morgan and co-workers (Morgan et a/., 1989) define peptide mimetics as "structures which 

20 serve as appropriate substitutes for peptides in interactions with receptors and enzymes". 
The mimetic must possess not only affinity but also efficacy and substrate function. For 
purposes of this disclosure, the terms "peptide mimetic" and "peptidomimetic" are used 
interchangeably according to the above excerpted definition. That is, a peptidomimetic 
exhibits function(s) of a particular peptide, without restriction of structure. Peptidomimetics of 

25 the invention, e.g., analogues of the neurotrophin and receptor binding sites described herein, 
may include amino acid residues or other chemical moieties which provide functional 
characteristics described herein. 

Known compounds may also be screened to determine if they satisfy the spatial and 
functional requirements for binding with the second LRM of TrkA. The molecules in the data 

30 bases are screened to determine if they contain effective moieties spaced relative to each 
other so that a first moiety hydrophobically interacts with binding area A, a second of moiety 
ionically interacts with binding area C, and a third moiety ionically interacts with binding area 
D of the second LRM of TrkA. 

The consents of all references and appendices (see Appendices 1, 2, 3, and 4) cited 

35 throughout this application are hereby expressly incorporated by reference. 
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EQUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following claims. 
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Table 1. Equilibrium variable torsional angles (in degrees) of the amino and carboxyl termini 
of the most stable conformations of the NGF dimers and analogues * 









Active structures 


| Inactive structures 


5 


No. 


hNGF 


mNGF 


mNGFD3-9 hNGF-H4D mNGFD1-8 BDNF 










Amino terminus 





11 




E 


82.9 


E 


85.9 


E 


89.2 


E 


80.9 


E 


63.7 


E 


68.2 


10 








55.7 




56.3 




57.8 




57.2 




47.9 




56.3 






xi 




-69.8 




-72.0 




-72.1 




-75.0 




-66.2 




-76.9 






X2 




72.7 




71.7 




66.0 




72.6 




71.2 




70.1 






X3 




-120.4 




-111.7 




-107.0 




-104.8 




-102.8 




-97.7 




10 


Hj 


G 


-89.1 


G 


-95.1 


G 


-88.0 


G 


-129.5 


G 


-81.6 


G 


-134.1 


15 




<t> 




-172.5 




-178.6 




-177.5 




104.7 




-80.8 




155.0 




9 


if 


R 


-36.1 


M 


-21.1 


S 


146.5 


R 


-46.5 


M 


-66.6 


R 


71.4 






<t> 




-92.8 




-91.4 




-103.6 




-80.9 




157.5 




-128.5 






X1 




-77.2 




-75.2 




-66.7 




-115.0 




-142.4 




-73.5 






X2 




-173.9 




-179.7 




64.0 




69.5 




-69.7 




-179.8 


20 




X3 




-175.5 




179.9 




- 




-141.9 




178.3 




-71.4 






X4 




134.3 




- 




- 




98.1 




- 




-109.8 




8 




H 


118.6 


H 


110.4 


s 


106.1 


H 


151.6 




- 


R 


105.2 






4> 




-116.5 




-110.4 




52.8 




-53.7 




- 




-100.1 






X1 




-89.7 




-90.9 




-45.9 




-62.0 








179.6 


25 




X2 




-73.8 




-71.5 




-172.3 




-72.5 








-105.5 






X3 
























162.1 






X4 
























133.7 




7 


U) 


F 


135.7 


F 


129.4 






F 


142.3 






A 


161.6 






<J> 




-68.8 




-89.0 








-114.8 








-166.6 


30 




X1 




-171.6 




-163.9 








-170.7 














X2 




-96.3 




-91.5 








-100.2 












6 




I 


40.3 


V 


61.7 






1 


40.0 






P 


114.4 






<J> 




40.4 




47.7 








39.9 














X1 




-159.7 




-177.3 








-159.7 










35 




X2 




87.1 












88.6 











54 



BNSDOCID: <WO 9806048A2J_> 



WO 98/06048 



PCT/CA97/00539 





5 




P 


156.0 


P 


149.8 


- 


P 


105.5 




- 


D 


82.8 






<t> 




- 




- 


- 




- 




- 




56.5 






X1 




- 




- 


- 




- 




- 




-176.9 






X2 




- 




- 


- 




- 




- 




-112.8 


5 


4 


if 


H 


121.4 


H 


122.1 


- 


D 


117.9 




- 


S 


136.2 






* 




-129.2 




-129.8 


- 




-102.8 




- 




-53.4 










-162.5 




-163.3 


- 




-66.4 




- 




-172.9 






X2 




-69.1 




-69.2 


- 




-97.5 




- 




-178.3 




3 




S 


2.1 


T 


-1.9 


- 


S 


-25.0 




- 


H 


153.8 


10 




4> 




-91.9 




-84.5 


- 




-68.2 




- 




48.2 






X1 




-171.8 




-174.9 


- 




39.3 




- 




82.1 






X2 




-53.2 




-51.2 


- 




-120.5 




- 




69.8 




2 


111 


s 


1.38.5 


S 


139.4 


- 


S 


38.5 




- 




- 










-128.9 




-114.1 


- 




50.8 




- 




- 


15 




X1 




173.3 




169.7 


- 




-173.2 




- 




- 






X2 




-141.9 




-139.8 






-120.6 








— 




1 




s 


133.0 


S 


127.9 




s 


131.9 














<t> 




48.6 




49.4 






-133.8 




- 




- 






X1 




-43.2 




-44.1 






-43.5 










20 




X2 




-171.5 




-172.4 






-173.6 




- 




- 
















Carboxyl terminus 














112' 


<t> 


L 


-113.3 


L 


-113.0 


L -112.4 


L 


-112.0 


L 


-111.5 


L 


-110.6 


25 








98.8 




98.7 


99.2 




97.7 




89.9 




100.1 






X1 




-72.3 




-71.6 


-71.6 




-71.7 




-71.8 




-72.1 






X2 




163.7 




163.0 


162.6 




162.2 




161.9 




161.1 




113' 


* 


S 


-83.4 


S 


-77.2 


S -77.4 


s 


-76.8 


S 


-91.9 


T 


-89.5 










136.1 




146.0 


143.5 




141.7 




149.4 




134.0 


30 




X1' 




177.0 




179.4 


176.3 




175.6 




169.6 




57.4 






X2 




65.2 




60.6 


59.0 




57.5 




73.2 




-179.7 




114' 


4> 


R 


-66.7 


R 


-71.5 


R -78.0 


R 


-75.1 


R 


-117.0 


I 


-57.9 










-47.0 




-53.7 


-59.3 




-57.1 




-31.9 




-32.2 






X1 




-99.1 




-145.6 


-93.4 




177.5 




-168.1 




-162.1 


35 




X2 




-173.1 




-179.8 


-171.5 




-159.7 




86.5 




87.9 






X3 




170.4 




-173.6 


-172.9 




58.1 




-73.8 
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10 



15 



20 



X4 




138.4 




140.3 




141.5 




124.7 




173.4 




- 


4> 


K 


-163.2 


K 


-162.1 


K 


-167.1 


K 


-163.0 


K 


-133.4 


K 


-167.7 






110.6 




132.0 




153.3 




157.6 




141.7 




126.1 


X1 




-139.8 




-143.3 




-140.1 




-137.8 




-164.0 




-142.2 


X2 




78.7 




81.7 




85.0 




79.3 




172.4 




83.5 


X3 




-96.4 




-97.9 




-95.8 




-97.4 




133.9 




-100.4 


X4 




129.5 




120.9 




131.0 




133.6 




-73.6 




109.2 


X5 




-39.1 




-35.6 




-47.7 




-44.1 




9.6 




-9.5 


* 


A 


-60.5 


A 


-69.6 


A 


-58.2 


A 


-63.9 


A 


-51.6 


R 


-61.4 






-47.5 




-40.7 




-47.5 




-42.4 




-45.3 




-39.3 


X1 




- 




- 




- 




- 




- 




-80.9 


X2 




- 




- 




- 




- 








-173.4 


X3 




- 




- 




- 




- 




- 




174.3 


X4 




- 




- 




- 




- 




- 




129.6 


4> 


V 


-71.8 


T 


-85.5 


T 


-64.6 


V 


-41.0 


T 


-114.6 


G 


-73.3 






112.0 




116.4 




108.2 




130.2 




168.4 




77.3 


X1 




-74.3 




53.4 




50.9 




-66.2 




57.2 




• 


X2 




- 




60.5 




58.0 




- 




-170.5 






* 


R 


-93.8 


R 


-118.2 


R 


-58.5 


R 


-125.2 


R 


-51.5 


R 


-49.0 






-57.9 




-48.4 




-37.6 




141.3 




-56.3 




-40.3 


X1- 




-104.3 




-80.9 




-109.5 




-126.2 




-153.8 




-125.9 


X2 




79.1 




109.1 




73.3 




106.7 




88.0 




98.0 


X3 




147.7 




116.1 




-76.3 




-78.8 




-74.9 




-76.6 


X4 




147.0 




116.1 




-87.2 




-98.5 




-125.9 




-111.5 



25 



•Torsional angles are designated according to the IUPAC-IUB rules (1970). 
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Table 2. Residues of BDNF and NGF defining the surfaces of their central P-strand stems, 
the side-chain thermal motion of which is taken into account when optimizing the 
structures of the complexes with the LRM-2B. 



BDNF 




NGF 




lie 16 


i 57 




Lys 57 


Glu 10 


Asn 


\ / tie 
Va! 


Arg 59 


Val 


Met 


vai 


Ser 


Thr 21 


Lys 65 


Val 22 




Asp 24 


Gin 79 


Asp 24 


Tyr 79 


Lys 25 


Arg 81 


Lys 25 


Thr 81 


Lys 26 


Thr* 2 


Thr 26 


Thr 82 


Thr 27 


Gin 84 


Thr 27 


His 64 


Val 29 


Tyr 86 


Thr 29 


Phe 86 


Tht 35 


Arg ,w 


Glu 35 


Arg 103 


Glu 55 


Asp 106 


Glu 55 


Asp 105 



20 

Table 3. Components of the molecular mechanical potential energy of the intermolecular 
NGF/LRM-2A interactions (in kcal/mol) 



25 Term hNGF mNGF mNGFA3-9 hNGF:NT-4/5 



E„ -61.2 -57.9 -39.3 -65.1 

Ee -96.2 -87.9 -99.0 -65.2 

-7.4 -5.9 -3.1 -4.6 

E M -0.3 -0.3 -0.1 -0.1 



Total -165.1 -152.0 -141.5 -135.1 



. 35 
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Table 4. Components of the molecular mechanical potential energy of the BDNF/LRM-2B 
and the NGF/LRM-2B complexes (in kcal/mol) 

A. Components of the potential energy 



10 



Ehb 

Et 



.E=~ 

— MM 



BDNF/LRM-2B 

89.3 
-183.8 
-31.0 
114.9 
0.4 



NGF/LRM-2B 
82.3 
-93.1 
-21.5 
114.0 
0.0 



Difference* 
-7.0 
90.7 
9.5 
-0.9 
-0.4 



15 



Total 



-10.2 



81.8 



92.0 



20 



25 



B. Components of the intermolecular interactions energy 

BDNF/LRM-2B NGF/LRM-2B Difference* 

Ew -48.0 -64-9 -16.9 

c .221.4 -144.2 2L2 



-5.6 -43 1.3 



Total -275.0 -213.5 61.5 



•Difference of the energy terms within the minimum-energy structures of the NGF/LRM-2B 
and the BDNF/LRM-2B complexes. Underlined values demonstrate that the difference in 
30 intermolecular electrostatic interactions dominates in the differences in potential energies of 
the complexes. 
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Table 5. Equilibrium variable torsional angles (in degrees) of the amino and carboxyl 

termini of the most stable conformations of the uncomplexed (UC) and complexed 
(C) NGF analogues.* 



5 


No. 






hNGF 




mNGF 






mNGFA3-9 










UC 


C 


UC 


C 




UC 


C 














Amino terminus 








10 


11 


V 


E 


82.9 


78.2 


E 85.9 


85.4 


E 


89.2 


96 6 






<P 




55.7 


60.3 


56.3 


56.3 




57 8 


60 






*1 




-69.8 


-69.3 


-72.0 


-69.4 




-72 1 


-7^ 7 






& 




72.7 


70.2 


71.7 


71.3 




66 0 


R7 n 






& 




-120.4 


-117.6 


-111.7 


-117.1 




-107.0 


-109 2 


15 


10 




G 


-89.1 


-122.1 


G -95.1 


-103.2 


G 


-88.0 


-94.8 






d> 




-172.5 


-166.5 


-178.6 


174.1 




-177.5 


168 7 




9 


V 


R 


-36.1 


-25.7 


M -21.1 


-14.3 


S 


146 5 

1 "TV?. «J 


lOD. I 






<t> 




-92.8 


-84.2 


-91.4 


-77.3 




1 VJO.O 


- 1 O^t. 1 






ri 




-77.2 


-92.9 


-75.2 


-84.9 






-1/51-1 


20 




jfi 




-173.9 


-174.1 


-179.7 


179.0 












jfi 




-175.5 


158.1 


179.9 


171.5 












A« 




134.3 


136.4 


- 


- 










8 


V 


H 


118.6 


98.3 


H 110.4 


96.1 


S 


I UD. I 


iDO.l 










-116.5 


-127.3 


-110.4 


-121.8 








25 








-89.7 


-108.0 


-90.9 


-104.1 




-45 Q 


-oo.o 










-73.8 


-87.9 


-71.5 


-65.8 




-172 3 


-174 1 




7 




F 


135.7 


113.6 


F 129.4 


106.7 












<p 




-68.8 


-122.4 


-89.0 


-130.8 












x\ 




-171.6 


-147.2 


-163.9 


-143.9 








30 




X2 




-96.3 


-85.8 


-91.5 


-87.5 










6 


V 


1 


40.3 


57.9 


V 61.7 


67.9 












<p 




40.4 


38.5 


47.7 


48.3 












*1 




-159.7 


-165.7 


-177.3 


-177.3 












X2 




87.1 


88.3 












35 


5 


V 


P 


156.0 


164.8 


P 149.8 


153.3 
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10 



15 



4 


If 


H 


121.4 


125.4 


H 


122.1 


122.9 




<fi 




-129.2 


-133.8 




-129.8 


-125.9 




& 




-162.5 


-163.2 




-163.3 


-161.8 








-69.1 


-71.9 




-69.2 


-69.9 


3 




S 


2.1 


-4.9 


T 


-1.9 


-5.1 




<p 




-91.9 


-78.9 




-84.5 


-82.3 








-171.8 


-172.3 




-174.9 


-171.4 




X2 




-53.2 


-51.2 




-51.2 


-52.3 


2 


If 


S 


138.5 


135.7 


S 


139.4 


139.4 




0 




-128.9 


-157.5 




-114.1 


-160.3 








173.3 


173.4 




169.7 


170.3 




A2 




-141.9 


-133.0 




-139.8 


-134.5 










wn .c 


.c 








if 


a 








1 ^ # . <if 






<P 




48.6 


46.6 




49.4 


53.8 




& 




-43.2 


-41.9 




-44.1 


-44.0 




X2 




-171.5 


-171.0 




-172.4 


-170.9 



Carboxyl terminus 



20 


112' 


4> 


L 


-113.3 


-113.1 






if 




98.8 


95.5 






xy 




-72.3 


-72.1 










163.7 


163.2 




113" 


<p 


S 


-83.4 


-72.5 


25 




If 




136.1 


147.9 






*1 




177.0 


179.1 






X2 




65.2 


56.0 




114" 


<t> 


R 


-66.7 


-101.6 






if 




-47.0 


-41.9 


30 




*1 




-99.1 


-175.3 






X2 




-173.1 


-130.0 






A3 




170.4 


159.9 






& 




H3S.4 


-117.4 




115' 


<P 


K 


-163.2 


-155.3 


35 




if 




110.6 


111.2 



L 


-113.0 


-114.1 


L 


-112.4 


-113.7 




98.7 


99.3 




99.2 


103.9 




-71.6 


-72.1 




-71.6 


-72.1 




163.0 


164.1 




162.6 


163.8 


S 


-77.2 


-70.5 


S 


-77.4 


-63.0 




146.0 


148.6 




143.5 


145.8 




179.4 


-177.9 




176.3 


-177.0 




60.6 


56.3 




59.0 


52.4 


R 


-71.5 


-92.5 


R 


-78.0 


-92.3 




-53.7 


-43.8 




-59.3 


-50.4 




-145.6 


-173.2 




-93.4 


-167.4 




-179.8 


-114.8 




-171.5 


-101.4 




-173.6 


151.4 




-172.9 


172.3 




140.3 


-■J 3D.® 




141.5 


o130.9 


K 


-162.1 


-152.2 


K 


-167.1 


-161.4 




132.0 


111.0 




153.3 


132.6 
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*1 




-139.8 


-150.7 




-143.3 


-156.2 




-140.1 


-144.6 








78.7 


-170.9 




81.7 


-170.0 




85.0 


-177.7 




& 




-96.4 


177.2 




-97.9 


-174.3 




-95.8 


-169.3 




X* 




129.5 


-165.4 




120.9 


-173.7 




131.0 


175.4 




AS 




-39.1 


-63.2 




-35.6 


-61.4 




-47.7 


-53.3 


116' 


0 


A 


-60.5 


-62.3 


A 


-69.6 


-61.7 


A 


-58.2 


-67.9 




V 




-47.5 


-43.9 




-40.7 


-47.8 




-47.5 


-40.4 


117' 


<P 


V 


-71.8 


-106.1 


T 


-85.5 


-106.0 


T 


-64.6 


-74.3 








112.0 


107.0 




116.4 


98.5 




108.2 


107.7 








-74.3 


-75.0 




53.4 


55.1 




50.9 


56.6 




& 










60.5 


63.2 




58.0 


54.0 


118* 


4> 


R 


-93.8 


-58.0 


R 


-118.2 


-58.4 


R 


-58.5 


-79.4 




*i> 




C7 O 


-CO "7 




AO A 

-TW.1 


CO 7 






a n .a 
— t^.w 








-104.3 


-88.5 




-80.9 


-84.7 




-109.5 


-135.3 




A2 




79.1 


74.4 




109.1 


79.7 




73.3 


108.0 




A3 




147.7 


161.4 




116.1 


147.5 




-76.3 


-70.1 




A* 




147.0 


119.7 




116.1 


118.7 




-87.2 


-135.3 



20 'Torsional angles are designated according to the IUPAC-IUB rules (1970). Torsional angles 
undergoing considerable changes upon LRM-2A binding are shown in bold. 
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Table 6. Optimized conformational variables (in degrees) of hNGF-complexed LRM-2A.* 



No Type 


4> 


if 




A2 


A3 A 1 * 


A5 


94tA Ara 


-161.6 


-62.5 


-169.2 


-164.9 


66.8 172.2 




95tA Asn 


-114.1 


132.0 


33.0 


-111.6 






96tA Leu 


-131.9 


114.2 


-161.2 


141.1 






97tA Thr 


-145.0 


125.0 


-179.9 


-176.6 






qntA lie 


-123.1 


21.6 


61.3 


105.7 






99tA Val 


-61.2 


107.3 


175.7 








100tA Lys 


54.4 


40.8 


-75.6 


161.7 


-174.8 166.2 


-41.3 


101tA Ser 


-97.4 


-40.0 


-176.4 


167.7 






102tA Gly 


144.4 


46.5 










103tA Leu 


-135.6 


155.5 


-85.0 


90.0 






1 04tA Ara 


-81.7 


179.3 


-33.6 


165.0 


-52.8 -144.7 




Phe 
lUJin rue 


-68.6 


-40.8 


-159.4 


93.3 






1 \J\J \J\ V Hi 


-106.1 


94.3 


176.1 








107tA Ala 


-173.5 


-54.8 










108tA Pro 




-26.2 










109tA Asp 


-85.5 


64.7 


-116.1 


102.8 






110tA Ala 


-117.5 


-57.2 










111tA Phe 


-79.4 


135.3 


38.4 


89.8 






11 9* A Hid 


-132.1 


78.5 


-67.1 


89.9 






113tA Phe 


-120.8 


-29.0 


54.6 


95.4 






114tA Thr 


-59.9 


-56.0 


-23.4 


-176.5 






115tA Pro 




-47.9 










116tA Arg 


-150.8 


142.7 


46.8 


98.5 


-100.4 155.7 




117tA Leu 


-76.3 


-39.4 


57.0 


117.9 







30 Torsional angles are designated according to the IUPAC-IUB rules (1970). 
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Table 7. Optimized conformational variables (in degrees) of BDNF-complexed LRM- 

2B* 



No. 


Type 


<P 


w 




Jk2 


*3 




AS 


94tB 


Arg 


mo 


- £ >4 7 


-69.7 


169.4 


-171.6 


119.4 




95tB 


Asn 




1^1 n 


-129 4 


-109.4 








96tB 


Leu 


P1 Q 

-0 1 .y 


UU. +J 


-92.9 


-58.2 








97tB 


Thr 


-138.3 


143. 0 


177 Q 


1R0 fi 








98tB 


tie 




1 Ifi A 




86.9 








99tB 


Val 


-1U/ .0 


lCj Q 

I O 1 .O 


-70 1 










100tB 


Asp 


"DO.o 


-72 8 


23.6 


95.2 








1 01 tB 


Ser 






-156 1 


157.6 








102tB 


Gly 


40 n 

**4.U 


44 4 












103tB 


Leu 






49.9 


87.1 








104tB 


Lys 




fiR 5 


-147 2 


173.5 


91.3 


-169.5 


45.1 


105tB 


Phe 


/ 0. / 


-7R 


-160.5 


-111.0 








106tB 


Val 


52. 0 


1Q Q 


-174 2 










107tB 


Ala 


Q 


OR 4 
4D.H 












A AQlD 


Tyr 


*%R 7 
-00. / 


X.W. W 


-46.5 


102.7 


-74.2 






109tB 


Lys 


71 4 


-27 6 


62 1 


-113.2 


176.6 


176.6 


-55.9 


A A f\l D 

110tB 


Aia 


OH 7 


-^2 6 












111tB 


Phe 


-63.5 


-37.4 


-168.3 


83.6 








112tB 


Leu 


-50.3 


-*f»3.4 


-1RQ 7 


91 0 








113tB 


Lys 


-61.5 


-48.0 


-78.9 


152.7 


-167.9 


63.2 


22.9 


114tB 


Asn 


-59.4 


-36.6 


-70.9 


-107.4 








115tB 


Ser 


-68.6 


-32.9 


-60.0 


68.6 








116tB 


Asn 


-80.7 


-45.4 


-83.4 


-96.8 








117tB 


Leu 


-127.8 


-58.8 


-75.6 


105.5 









•Torsional angles are designated according to the IUPAC-IUB rules (1970). 
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Table 8. 



Contributions of individual residues to intermolecular electrostatic interaction 
energies of the complexes BDNF/LRM-2B and NGF/LRM-2B (in kcal/mol)*. 
A. Binding epitopes of neurotrophic 



Total 



BDNF/LRM-2B 


NGF/LRM-2B 


Dine re 


Glu 18 


-61.6 


val 


u.o 


bi.y 


Asp 24 


-65.1 


Asp'' 


-44.9 


20.2 


Lys 25 


25.3 


Lys25 


15. 0 


-y.o 


Gly 33 


-11.9 


Gly 33 


-13.4 


-1 5 


Thr 35 


-11.4 


Glu 35 


-50.5 


-39.1 


Glu 55 


-50.2 


Glu 55 


-18.9 


31.3 


Lys 57 


27.1 


Lys 57 


22.5 


-4.6 


Arg 81 


-32.2 


Thr 61 


0.1 


32.3 


Ulll 


1.2 


His 84 


11.5 


10.3 


Arg 104 


-14.3 


Arg 103 


-9.5 


4.8 


Asp 106 


-39.7 


Asp 105 


-46.4 


-6.7 




-232.8 




-133.7 


99.1 



20 



25 



30 



BDNF/LRM-2B 
Arg 94 ® -40.9 

Ash 95 " 5 0.2 



Asp 



100CB 



Ser 



101 tfi 



-42.1 
-17.9 



Gly 102tB -1.3 
Lys 10 * 6 -18.5 
Lys 10>,B -46.4 
Lys 11 *® -54.6 



B. Binding epitopes of the LRM-2B 



NGF/LRM-2B 

4.2 
-12.8 

-9.3 

-0.1 
-12.0 
-48.1 
-21.5 
-37.2 



Difference*. 
45.1 
-13.0 
32.8 
17.8 
-10.7 
-29.6 
24.9 
17.4 



35 



Total -221.5 -136.8 84.7 

•The ±10 kcal/mol cutoff of intermolecular electrostatic interaction energy has been used to 
identify binding epitopes within the complexes. ^Difference of the intermolecular electrostatic 
interaction energies of the given residue within the minimum-energy structures of the 
MGF/LRM-2B and the BDNF/LRM-2B complexes. Residues within the binding epitopes of 
BDNF and the LRWI-2B which cause their binding specificity are bolded. 
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Table 9. Contributions of individual residues of the neurotrophins to the intermolecular 
interaction energy of their complexes with the common neurotrophin receptor p75 m ^ 
(kcal/mol)* 











BDNF 






NT- 3 






NT-4/5 








Res. 


EL 


VDW 


Res. 


EL 


VDW 


Res. 


EL 


VDW 


Res. 


EL 


VDW 




D-30 


17.9 


-0.9 


D-30 


25.0 


-1.8 


D-29 


24.6 


-2.6 


D-32 


19.8 


-1.8 




K-32 


-94.9 


-6.0 


S-32 


0.5 


-4.5 


R-31 


-48.2 


-8.0 


R-34 


-42.6 


-9.3 


10 


K-34 


-88.8 


-3.3 


G-34 


-1.8 


-3.1 


H-33 


-35.0 


-7.1 


R-36 


-57.2 


-7.7 




E-35 


-10.0 


-2.2 


T-35 


1.7 


-1.4 


Q-34 


0.5 


-2.3 


D-37 


-0.8 


-1.7 




H-84 


-30.9 


-1.0 


Q-84 


1.19 


-2.4 


Q-83 


0.8 


-0.7 


Q-83 


0.6 


-1.0 




r— ,or 

r-ou 




n *a 




_n «; 


-0.5 


Y-85 


-13.4 


0.3 


Y-96 


-9.2 


-0.3 




K-88 


-8.5 


-0.1 


R-88 


-12.7 


-0.2 


R-87 


-20.0 


-0.3 


R-98 


-16.2 


-0.2 


1 5 


D-93 


-9.0 


-1.3 


D-93 


13.7 


-2.2 


E-92 


24.1 


-2.8 


D-103 


9.0 


-2.1 




K-95 


-63.7 


-5.6 


K-95 


-82.9 


-7.6 


N-94 


-0.1 


-0.2 


Q-105 


-0.3 


-0.2 




Q-96 


-0.3 


-3.7 


K-96 


-58.2 


2.0 


K-95 


-49.6 


-2.6 


G-106 


-0.6 


-0.6 




A-97 


0.2 


-0.2 


R-97 


-79.3 


-5.7 


L-96 


-0.5 


-6.7 


R-107 


-28.0 


-11.2 




K-iOO 


-1 l.o 


-u. / 


p.ini 


-59.2 


-1.1 


R-100 


-79.3 


1.7 


R-111 


-57.8 


-1.9 


20 


R-103 


-64.5 


0.0 


R-104 


-64.2 


-1.6 


R-103 


-51.5 


0.6 


R-114 


-55.3 


-2.9 




D-105 


10.6 


-0.2 


D-106 


8.3 


-0.2 


D-105 


5.5 


-0.1 


D-116 


4.8 


-0.1 




Total 


-336.1 


-15.6 




-317.8 


-18.2 




-248.3 


-15.6 




-237.3 


-35.0 






100% 


55% 




99% 


39% 




97% 


32% 




102% 


61% 


25 


Net 


-337.3 


-32.3 




-321.4 -46.7 




-254.8 -48.9 




-233.0 


-57.5 



*EL and VDW denote electrostatic and van der Waais terms of the ligand-receptor interaction 
energy, respectively. The ±10 kcal/mol cutoff of the intermolecular electrostatic interaction 
30 energy has been used to identify the functional epitopes (bolded). TotaT electrostatic and 

van der Waals contributions of the functional epitopes to the "net" values of the intermolecular 
interaction energy calculated over the whole binding interface are presented. Residue 
numbering corresponds to the general alignment of neurotrophins (Bradshaw et at. % 1994). 

* 35 

65 



BNSDOCID: <WO 9806048 A2_l_> 



PCT/CA97/00539 

WO 98/06048 

Table 10. Contributions of individual residues of p75 NTR to the intermodular interaction 
energy of their complexes with neurotrophins (kcal/moir. 





NGF 






BDNF 






NT-3 






NT-4/5 






5 


Res. 


EL 


VDW 


Res. 


EL 


VDW 


Res. 


EL 


VDW 


Res. 


EL 


VDW 




D-47 


-46.0 


c\ o 
u.z 


n_47 


-67.1 


0.1 


D-47 


-50.5 


-0.3 


D-47 


-26.7 


-4.1 




E-53 


-2.6 


-0.2 


E-53 


-55.4 


1.6 


E-53 


-6.6 


-0.2 


E-53 


-3.6 


-0.1 




P-54 


-12.4 


-2.8 


P-54 


-13.5 


-2.5 


P-54 


-15.1 


0.2 


P-54 


-5.3 


-0.8 


10 


C-55 


-1.4 


-0.9 


C-55 


-8.4 


-1.3 


C-55 


-14.9 


-1.0 


C-55 


-3.0 


-1.2 




K-56 


21.8 


-4.5 


K-56 


13.2 


-4.8 


K-56 


32.9 


-5.0 


K-56 


14.8 


-5.8 




E-60 


-6.2 


-0.3 


E-60 


-3.0 


-0.1 


E-60 


-10.0 


-0.2 


E-60 


-10.4 


-0.4 








r\ r\ 




-9.7 


-0.2 


E-73 


-12.5 


-0.3 


E-73 


-11.2 


-0.3 








l_ f w 














D-75 


-56.5 


0.6 


D-75 


-77.1 


0.4 


D-75 


-65.5 


-1.1 


D-75 


-59.4 


-2.7 


15 


D-76 


-91.6 


-2.0 


D-76 


-22.3 


-8.4 


D-76 


-39.9 


-3.7 


D-76 


-39.3 


-5.1 




R-80 


-18.5 


-3.9 


R-80 


2.9 


-2.1 


R-80 


5.3 


-4.0 


R-80 


3.1 


-4.3 




D-88 


-49.1 


0.4 


D-88 


-39.8 


-0.4 


D-88 


-34.4 


0.4 


D-88 


-33.1 


-1.2 




E-89 


-32.5 


-1.6 


E-89 


-25.2 


-6.2 


E-89 


-33.8 


-3.8 


E-89 


-33.0 


-5.9 


20 


Total 


-285.4 


-13.6 




-287.2 


-20.2 




-243.7 


-14.8 




-198.3 


-25.5 






84% 


42% 




89% 


43% 




96% 


30% 




85% 


44% 



•See footnote to Table 9. 
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Appendix 1. Geometry of the LRM-2A in PDB format.* 



REMARK 1 The bioactive conformation of the second LRM of TrkA 
REMARK 2 HSC stands for charged HIS 



w? 


REMARK 


3 






ATOM 


1 N LEU S 93 

1 111 Uks V W WW 


19.399-23.034 -15.306 1.00-0.45 




ATOM 


7 OA LEU S 93 


18.757-22.140-14.332 1.00-0.07 






3 C LEU S 93 


19.555-20.869-14.130 1.00 0.53 




ATOM 

^ 1 V./I VI 


4 O LEU S 93 


20.777 -20.864 -14.044 1.00 -0.50 


10 


ATOM 


5 CB LEU S 93 


18.635-22.886-13.002 1.00 0.02 




ATOM 


6 CG LEU S 93 


17.661 -22.206-12.037 1.00 0.05 




ATOM 


7 CD1 LEU S 93 


16.251 -22.083-12.617 1.00-0.01 






o rno i .ci i o oi 


17.517 -22.968 -10.71 8 1 .00 -0.01 




ATOM 


9 1H LEU S 93 


19.651 -23.930-14.841 1.00 0.15 




ATOM 


10 2H LEU S 93 


18.742-23.223-16.089 1.00 0.15 




ATOM 


11 3H LEU S 93 


20.260-22.582-15.677 1.00 0.15 


4 


ATOM 


12 N ARG S 94 


18.551 -20.159-13.624 1.00-0.52 




ATOM 


13 CA ARG S 94 


18.777-19.044-12.708 1.00 0.24 




ATOM 


14 C ARG S 94 


17.460-18.800-12.010 1.00 0.53 


on 


ATOM 


15 O ARG S 94 


17.364-18.941 -10.781 1.00-0.50 




ATOM 


16 CB ARG S 94 


19.316-17.812-13.451 1.00 0.05 




ATOM 


17 CG ARG S 94 


19.818-16.700-12.515 1.00 0.06 




ATOM 


18 CD ARG S 94 


20.009-15.345-13.215 1.00 0.11 




ATOM 


19 NF ARG S 94 


18.720-14.803-13.642 1.00-0.49 




ATOM 


?f) P7 ARG S 94 


18.631 -13.550-14.138 1.00 0.81 




ATOM 

A\ 1 I VI 


21 NH1 ARG S 94 


19.703-12.767-14.256 1.00-0.63 




ATOM 


22 NH2 ARG S 94 


17.444-13.087-14.518 1.00-0.63 




ATOM 




17 611 -20 389-13 873 1 00 0 25 

1 f . w 1 1 \J . w W w 1 w • W f w * • ww w .fc w 




ATOM 


24 HE ARG S 94 


17.892-15.358-13.571 1.00 0.29 


30 


ATOM 


25 1HH1 ARCS 94 


19.604-11.843-14.627 1.00 0.36 




ATOM 


26 2HH1 ARG S 94 


20.606-13.094-13.977 1.00 0.36 




ATOM 


27 1HH2 ARG S 94 


17.363-12.161 -14.887 1.00 0.36 




ATOM 


28 2HH2 ARG S 94 


16.633-13.666-14.434 1.00 0.36 




ATOM 


29 N ASN S 95 


16.446-18.435-12.789 1.00-0.52 


35 


ATOM 


30 CA ASN S 95 


15.116-18.165-12.251 1.00 0.22 




ATOM 


31 C ASN S 95 


14.111 -19.178-12.745 1.00 0.53 
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ATOM 32 O ASNS 95 14.047-19.466-13.950 1.00-0.50 

ATOM 33 CB ASNS 95 14.629-16.742-12.566 1.00 0.00 

ATOM 34 CG ASNS 95 15.171 -16.300-13.911 1.00 0.68 

ATOM 35 OD1 ASN S 95 16.007 -15.410-13.997 1.00 -0.47 

5 ATOM 36 ND2 ASN S 95 14.657 -16.965 -14.959 1 .00 -0.87 

ATOM 37 H ASNS 95 16.591 -18.340-13.774 1.00 0.25 

ATOM 381HD2ASNS 95 14.948-16.750-15.892 1.00 0.34 

ATOM 39 2HD2 ASN S 95 13.976 -17.684 -14.820 1.00 0.34 

ATOM 40 N LEU S 96 13.325-19.720-11.819 1.00-0.52 

10 ATOM 41 CA LEUS 96 12.308-20.713-12.151 1.00 0.20 

ATOM 42 C LEUS 96 11.023-20.296-11.479 1.00 0.53 

ATOM 43 O LEUS 96 10.934-20.287 -10.242 1.00-0.50 

~~ *t-TC-r iOQ .1 1 7f)1 1.00 0.02 

ATOM 44 CB LtU is »o 

ATOM 45 CG LEUS 96 12.022-23.268-12.383 1.00 0.05 

15 ATOM 46 CD1 LEU S 96 12.921 -24.464-12.732 1.00-0.01 

ATOM 47 CD2 LEU S 96 10.797 -23.704 -1 1 .562 1 .00 -0.01 

ATOM 48 H LEUS 96 13.428-19.444-10.863 1.00 0.25 

ATOM 49NTHRS 97 10.026-19.951 -12.289 1.00-0.52 

ATOM 50 CA THR S 97 8.726 -19.526 -1 1 .778 1 .00 0.27 

20 ATOM 51 C THR S 97 7.604-19.988-12.678 1.00 0.53 

ATOM 52 O THR S 97 7.607-19.702-13.884 1.00-0.50 

ATOM 53 CB THR S 97 8.692 -18.000 -11.598 1.00 0.21 

ATOM 54 OG1 THR S 97 7.420 -17.594 -1 1 .097 1 .00 -0.55 

ATOM 55 CG2 THR S 97 9.014 -17.248 -12.899 1.00 0.01 

25 ATOM 56 H THR S 97 10.163-19.980-13.279 1.00 0.25 

ATOM 57 HG1 THR S 97 7.431 -16.635 -1 1 053 1 .00 0.31 

ATOM 58 N ILES 98 6.645-20.701 -12.095 1.00-0.52 

ATOM 59 CA ILE S 98 5.499-21.213-12.841 1.00 0.20 

ATOM 60 C ILES 98 4.166-20.765-12.292 1.00 0.53 

30 ATOM 61 O ILE S 98 3.134-21 406-12.544 1.00-0.50 

ATOM 62 CB ILES 98 5.633-22.735-13.006 1.00 0.03 

ATOM 63 CG1 ILES 98 5.655-23.480-11-662 1.00 0.02 

ATOM 64 CG2 ILE S 98 6.841 -23.109 -13.880 1.00 0.00 

ATOM 65 CD1 ILES 98 4.338-24.216-11.369 1.00 0.00 

35 ATOM 66HILES 98 6.703 -20.897 -11.116 1.00 0.2S 

ATOM 67NVALS 99 4.183-19.667-11.541 1.00-0.52 
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ATOM 68 CA VAL S 99 
ATOM 69 C VALS 99 
ATOM 70 O VALS 99 
ATOM 71 CB VALS 99 
ATOM 72 CG1 VAL S 99 
ATOM 73 CG2 VAL S 99 
ATOM 74 H VALS 99 
ATOM 75 N LYS S 100 
ATOM 76 CA LYS S 100 
ATOM 77 C LYS S 100 
ATOM 78 O LYS S 100 
ATOM 79 CB LYS S 100 
ATOM 80 CG LYS S 100 
ATOM 81 CD LYS S 100 
ATOM 82 CE LYS S 100 
ATOM 83 NZ LYS S 100 
ATOM 84 H LYS S 100 
ATOM 85 1HZ LYS S 100 
ATOM 86 2HZ LYS S 100 
ATOM 87 3HZ LYS S 100 
ATOM 88 N SER S 101 
ATOM 89 CA SER S 101 
ATOM 90 C SER S 101 
ATOM 91 O SER S 101 
ATOM 92 CB SER S 101 
ATOM 93 OG SER S 101 
ATOM 94 H SER S 101 
ATOM 95 HG SER S 101 
ATOM 96 N GLY S 102 
ATOM 97 CA GLY S 102 
ATOM 98 C GLY S 102 
ATOM 99 O GLY S 102 
ATOM 100 H GLY S 102 
ATOM 101 N LEU S 103 
ATOM 102 CA LEU S 103 
ATOM 103 C LEU S 103 



2.969-19.119-10.945 1.00 0.20 
1.976-18.727-12.013 1.00 0.53 
2.172-17.725-12.717 1.00-0.50 
3.389-17.945-10.045 1.00 0 03 
2.188-17.207 -9.432 1.00 0.01 
4.444-18.324 -8.993 1.00 0.01 
5.050-19.197-11.376 1.00 0.25 
0.911 -19.514-12.136 1.00-0.52 
-0.131 -19.257-13.126 1.00 0.23 
0.517-19.154-14.486 1.00 0.53 
0.146-18.291 -15.296 1.00-0.50 
-0.916-17.989-12.753 1.00 0.04 

-1.31 I - lO.I/O - I ' — — » 

-2.337-16.845-10.961 1.00 0.05 
-3.230-17.043 -9.726 1.00 0.22 
-3.328-15.780 -8.983 1.00-0.27 
0.814-20.308-11 536 1.00 0.25 
-2.395-15.321 -8.953 1.00 0.31 
-3.649-15.972 -8.012 1.00 0.31 
-4.008-15.150 -9.454 1.00 0.31 
1.485-20.030-14.739 1.00-0.52 
2.199-20.048-16.012 1.00 0.36 
1.652-21.075-16.974 1.00 0.53 
1.546-20.813-18.182 1.00-0.50 
3.695-20.297-15.758 1.00 0.19 
4.426-20.237-16.981 1.00-0.55 
1.736-20.701 -14.041 1.00 0.25 
5.354-20.203-16.738 1.00 0.31 
1 .302 -22.243 -16.444 1 .00 -0.52 
0.758-23.329-17.254 1.00 0.25 
1.148-24.732-16.854 1 00 0.53 
1.526-25.548-17.708 1.00-0.50 
1.413-22.390-15.461 1.00 0.25 
1.058-25.015-15.558 1.00-0.52 
1.402-26.331 -15.027 1.00 0.20 
0.326-26.726-14.045 1.00 0.53 
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ATOM 104 O LEU S 103 
ATOM 105 CB LEU S 103 
ATOM 106 CG LEU S 103 
ATOM 107 CD1 LEU S 103 
5 ATOM 108 CD2 LEU S 103 
ATOM 109 H LEU S 103 
ATOM 110 N ARG S 104 
ATOM 111 CA ARG S 104 
ATOM 112 C ARG S 104 
10 ATOM 113 O ARG S 104 
ATOM 114 CB ARG S 104 
ATOM 115 CG ARG S 104 
ATOM- 116 CD ARG S 104 
ATOM 117 NE ARG S 104 
15 ATOM 118 CZ ARG S 104 
ATOM 119 NH1 ARG S 104 
ATOM 120 NH2 ARG S 104 
ATOM 121 H ARG S 104 
ATOM 122 HE ARG S 104 
20 ATOM 123 1HH1 ARG S 104 
ATOM 124 2HH1 ARG S 104 
ATOM 125 1HH2 ARG S 104 
ATOM 126 2HH2 ARG S 104 
ATOM 127 N PHE S 105 
25 ATOM 128 CA PHE S 105 
ATOM 129 C PHE S 105 
ATOM 130 O PHE S 105 
ATOM 131 CB PHE S 105 
ATOM 132 CG PHE S 105 
30 ATOM 133 CD1 PHE S 105 
ATOM 134 CD2 PHE S 105 
ATOM 135 CE1 PHE S 105 
ATOM 136 CE2 PHE S 105 
ATOM 137 CZ PHE S 105 
35 ATOM 138 H PHE S 105 
ATOM 139 N VAL S 106 



PCT/CA97/00539 

-0.356-25.860-13.476 1.00-0.50 
2.792-26.289-14.372 1.00 0.02 
3.957-26.492-15.354 1.00 0.05 
4.502-25.183-15.949 1.00-0.01 
5.078-27.333-14.723 1.00-0.01 
0.745-24.311 -14.920 1.00 0.25 
0.170-28.031 -13.843 1.00-0.52 
-0.832-28.558-12.921 1.00 0.24 
-0.225-28.464-11.541 1.00 0.53 
0.920-28.016-11.387 1.00-0.50 
-1.253-29.984-13.311 1.00 0.05 
-1.250-30.233-14.827 1.00 0.06 
-1.308-31.721 -15.207 1.00 0.11 
-0.254-32.467-14.521 1.00-0.49 
0.365-33.502-15.129 1.00 0.81 
0.042 -33.880 -16.366 1 .00 -0.63 
1.319-34.159-14.478 1.00-0.63 
0.754-28.678-14.334 1.00 0.25 
0.018-32.215-13.592 1.00 0.29 
0.519-34.651 -16.789 1.00 0.36 
-0.672-33.401 -16.875 1.00 0.36 
1.786-34.927-14.915 1.00 0.36 
1.569-33.884-13.549 1.00 0.36 
-0.990-28.886-10.538 1.00-0.52 
-0.535-28.854 -9.152 1.00 0.21 
0.558-29.894 -9.090 1.00 0.53 
1.596-29.679 -8.447 1.00-0.50 
-1.656-29.230 -8.170 1.00 0.04 
-1.354-28.726 -6.778 1.00 0.01 
-1.851 -27.471 -6.366 1.00-0.01 
-0.575-29.526 -5.913 1.00-0.01 
-1.566-27.007 -5.067 1.00 0.00 
-0.290-29.063 -4.616 1.00 0.00 
-0.788 -27 MOB -4.206 1.00 0.00 
-1.906 -29.236 -10.732 1.00 0.25 
0.329-31.021 -9.758 1.00-0.52 
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ATOM 


140 


CA VAL S 106 


1.297-32.114 -9.787 1.00 0.20 




ATOM 


141 


C VAL S 106 


1.984-32.186-11.130 1.00 0.53 




ATOM 


142 


O VAL S 106 


1.469-32.815-12.067 1.00-0.50 




ATOM 


143 


CB VAL S 106 


0.545-33.403 -9.418 1.00 0.03 


5 


ATOM 


144 


CG1 VAL S 106 


1.433-34.654 -9.506 1.00 0.01 




ATOM 


145 


CG2 VALS 106 


-0.212-33.313 -8.084 1.00 0.01 




ATOM 


146 


H VAL S 106 


-0.530-31.131 -10.258 1.00 0.25 




ATOM 


147 


N ALA S 107 


3.145-31.545-11.226 1.00-0.52 




ATOM 


148 


CA ALA S 107 


3.920-31.529-12.463 1.00 0.22 


10 


ATOM 


149 


C ALA S 107 


5.237-30.854-12.163 1.00 0.53 




ATOM 


150 


O ALA S 107 


6.308-31.425-12.415 1.00-0.50 




ATOM 


151 


CB ALA S 107 


3.199-30.751 -13.576 1.00 0.03 




r~\ i wivi 


152 


H ALAS 107 


3.504-31.056-10.432 1.00 0.25 




ATOM 


153 


N PROS 108 


5.161 -29.640-11.625 1.00-0.26 


15 


ATOM 


154 


CA PRO S 108 


6.351 -28.871 -11.283 1.00 0.11 




ATOM 


155 


C PRO S 108 


7.264-29.638-10.357 1.00 0.53 




ATOM 


156 


O PRO S 108 


8.481 -29.403-10.336 1.00-0.50 




ATOM 


157 


CB PRO S 108 


5.782-27.574-10.686 1.00 0.00 




ATOM 


158 


CG PRO S 108 


4.356-27.900-10.256 1.00 0.04 


20 


ATOM 


159 


CD PRO S 108 


3.932-28.917-11.305 1.00 0.08 




ATOM 


160 


N ASP S 109 


6.680-30.556 -9.591 1.00-0.52 




ATOM 


161 


CA ASP S 109 


7.439-31.374 -8.649 1.00 0.25 




ATOM 


162 


C ASP S 109 


7.956-32.544 -9.451 1.00 0.53 




ATOM 


163 


O ASP S 109 


7.574-33.697 -9.200 1.00-0.50 


25 


ATOM 


164 


CB ASP S 109 


6.510-31.884 -7.536 1.00 -0.21 




ATOM 


165 


CG ASP S 109 


6.933-31.318 -6.195 1.00 0.62 




ATOM 


166 


OD1 ASP S 109 


6.290-30.391 -5.711 1.00-0.71 




ATOM 


167 


OD2 ASP S 109 


7.907-31.800 -5.626 1.00-0.71 




ATOM 


168 


H ASP S 109 


5.692 -30.697 -9.657 1.00 0.25 


30 


ATOM 


169 


N ALAS 110 


8.824-32.252-10.416 1.00-0.52 




ATOM 


170 


CA ALA S 110 


9.406-33.282-11.270 1.00 0.22 




ATOM 


171 


C ALA S 110 


10.898-33.275-11.034 1.00 0.53 




ATOM 


172 


O ALAS 110 


11.481 -34.304-10.662 1.00-0.50 




ATOM 


173 


CB ALA S 110 


9.132-33.009-12.758 1.00 0.03 


35 


ATOM 


174 


H ALAS 110 


9.090-31.300-10.566 1.00 0.25 




ATOM 


175 


N PHE S 111 


11.517-32.118-11.249 1.00-0.52 
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ATOM 176 CA PHE S 111 12.956-31.964-11.061 1.00 0.21 
ATOM 177 C PHE S 111 13.127-31.810 -9.568 1.00 0.53 

ATOM 178 O PHE S 111 12.376-31.064 -8.923 1.00-0.50 

ATOM 179 CB PHE S 111 13.497-30.712-11.770 1.00 0.04 

5 ATOM 180 CG PHE S 111 12.526-29.559-11.656 1.00 0.01 

ATOM 181 CD1PHES 111 12.618-28.673-10.561 1.00-0.01 

ATOM 182 CD2 PHE S 111 11.542-29.391 -12.654 1.00 -0.01 

ATOM 183 CE1 PHE S 111 11.710-27.601 -10.462 1.00 0.00 

ATOM 184 CE2PHES111 10.634-28.321 -12.556 1.00 0.00 

10 ATOM 185 CZ PHE S 111 10.727-27.436-11.461 1.00 0.00 

ATOM 166 H PHE S 111 10.987-31.324-11.549 1.00 0.25 

ATOM 187 N HSCS 112 14.112-32.515 -9.019 1.00-0.52 

ATOM 188 H HSCS112 i 4.681 -33.099 -9.597 1.00 0.25 
ATOM 189 CA HSCS 112 14.393-32.465 -7.587 1.00 0.19 
15 ATOM 190 CB HSCS 112 13.775-33.685 -6.885 1.00 0.21 
ATOM 191 CG HSCS 112 12.263-33.637 -6.920 1.00 0.10 
ATOM 192 CD2HSCS 112 11.396-33.062 -5.986 1.00 0.35 
ATOM 193 ND1HSCS112 11.533-34.176 -7.915 1.00-0.61 
ATOM 194 HD1 HSC S 112 11.881 -34.643 -8.703 1.00 0.48 
20 ATOM 195 CE1HSCS 112 10.215 -33.947 -7.619 1.00 0.72 
ATOM 196 NE2HSCS112 10.132-33.262 -6.434 1.00-0.69 
ATOM 197 HE2HSCS 112 9.308-32.971 -5.991 1.00 0.49 

ATOM 198 C HSC S 112 15.841 -32.262 -7.208 1.00 0.53 
ATOM 199 O HSC S 112 16.534-33.221 -6.836 1.00-0.50 

25 ATOM 200 N PHE S 113 16.299-31.017 -7.301 1.00-0.52 
ATOM 201 CA PHE S 113 17.678-30.673 -6.967 1.00 0.21 
ATOM 202 C PHE S 113 17.554-29.654 -5.860 1.00 0.53 
ATOM 203 O PHE S 113 18.444-29.545 -5.003 1.00-0.50 
ATOM 204 CB PHE S 113 18.418-30.047 -8.160 1.00 0.04 
30 ATOM 205 CG PHE S 113 17.666 -28.854 -8.702 1.00 0.01 
ATOM 206 CD1 PHE S 113 18.011 -27.554 -8.270 1.00-0.01 
ATOM 207 CD2PHES 113 16.627-29.064 -9.635 1.00-0.01 
ATOM 208 CE1 PHE S 1113 17.306 -26.447 -8.780 1.00 Q.00 
ATOM 209 CE2 PHE S 113 15.922-27.959-10.145 1.00 0.00 
35 ATOM 210 CZ PHE S 113 16.268-26.661 -9.712 1.00 0.00 
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ATOM 


211 


H PHE S 1 13 




ATOM 


212 


N THR S 114 




ATOM 


213 


CA THR S 1 14 




ATOM 


214 


C THR S 114 


5 


ATOM 


215 


O THR S 114 




ATOM 


216 


CB THR S 114 




ATOM 


217 


OG1 THR S 114 




ATOM 


218 


CG2 THR S 114 




ATOM 


219 


H THR S 114 


10 


ATOM 


220 


HG1 THR S 114 




ATOM 


221 


N PROS 115 




ATOM 


222 


CA PRO S 115 




ATOM 


-> o 

iij 


C PROS 115 




ATOM 


224 


O PROS 115 


15 


ATOM 


225 


CB PRO S 115 




ATOM 


226 


CG PRO S 115 




ATOM 


227 


CD PRO S 115 




ATOM 


228 


N ARG S 116 




ATOM 


229 


CA ARG S 116 


20 


ATOM 


230 


C ARG S 116 




ATOM 


231 


O ARG S 116 




ATOM 


232 


CB ARG S 116 




ATOM 


233 


CG ARG S 1 16 




ATOM 


234 


CD ARG S 116 


25 


ATOM 


235 


NE ARG S 116 




ATOM 


236 


CZ ARG S 116 




ATOM 


237 


NH1 ARG S 116 




ATOM 


238 


NH2 ARG S 116 




ATOM 


239 


H ARG S 116 


30 


ATOM 


240 


HE ARG S 116 




ATOM 


241 1HH1 ARG S 116 




ATOM 


242 2HH1 ARG S 116 




ATOM 


243 1HH2 ARG S 116 




ATOM 


244 2HH2 ARG S 116 


35 


ATOM 


245 


N LEU S 117 




ATOM 


246 


CA LEU S 117 



PCT/CA97/00539 

15.686-30.289 -7.608 1.00 0.25 
16.454-28.908 -5.874 1.00-0.52 
16.200-27.883 -4.866 1.00 0.27 
16.150-28.477 -3.478 1.00 0.53 
16.898-28.053 -2.585 1.00-0.50 
14.903-27.124 -5.185 1.00 0.21 

14.606-27.224 -6.576 1.00 -0.55 

14.967-25.648 -4.762 1.00 0.01 
15.774-29.046 -6.595 1.00 0.25 

13.824-26.691 -6.734 1.00 0.31 

15.270-29.457 -3.296 1.00-0.26 

15.113-30.124 -2.008 1.00 0.11 
*e iioo iti cQi -i cir» -i no n 

16.789-30.490 -0.343 1.00-0.50 

14.024-31.174 -2.277 1.00 0.00 

14.008-31.384 -3.788 1.00 0.04 

14.364-29.999 -4.305 1.00 0.08 
17.122-31.399 -2.390 1.00-0.52 

18.405-32.006 -2.048 1.00 0.24 
19.201 -32.059 -3.330 1.00 0.53 
18.642-32.307 -4.408 1.00-0.50 

18.212- 33.381 -1.388 1.00 0.05 

17.213- 34.284 -2.128 1.00 0.06 
17.881 -35.312 -3.055 1.00 0.11 
17.832-34.856 -4 444 1.00-0.49 
18.735-35.300 -5.346 1.00 0.81 

19.694-36.161 -5.009 1.00-0.63 
18.663-34.866 -6.600 1.00-0.63 
16.767-31.523 -3.317 1.00 0.25 
17.128-34.209 -4.737 1.00 0.29 
20.347-36.471 -5.701 1.00 0.36 
19.767-36.501 -4.072 1.00 0.36 
19.324-35.186 -7.279 1.00 0.36 
17.947-34.219 -6.864 1.00 0.36 
20.506-31.828 -3.216 1.00-0.52 
21.399-31.846 -4.370 1.00 0.23 
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5 



ATOM 


247 


C LEU S 117 


21.643-33.299 - 


4.741 1.00 0.50 


ATOM 


248 


O LEU S 117 


21.687-33.594 - 


5.933 1.00-0.25 


ATOM 


249 


CB LEU S 117 


22.706 -31.110 


-4.038 1.00 0.02 


ATOM 


250 


CG LEU S 117 


23.454 -31.672 


-2.818 1.00 0.05 


ATOM 


251 


CD1 LEU S 117 


24.851 -32.222 


-3.147 1.00-0.01 


ATOM 


252 


CD2 LEU S 117 


23.506 -30.647 


-1.674 1.00-0.01 


ATOM 


253 


OXTLEUS117 


21.785-34.114 


-3.834 1.00-0.25 


ATOM 


254 


H LEU S 117 


20.894-31.634 • 


-2.315 1.00 0.25 


END 











10 

•Principal residues are bolded. 
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Appendix 2. Geometry of the LRM-2B in PDB format.* 

REMARK 1 The bioactive conformation of the second LRM of TrkB 







2 








ATOM 




N LEU S 93 


32.287 14.723 17.944 1.00-0.45 




ATOM 




CA LEU S 93 


-33.186 15.831 17.679 1.00-0.07 




ATOM 


3 


C LEU S 93 - 


32.411 17.027 17.215 1.00 0.53 




A 1 UM 


4 


O LEU S 93 - 


31 203 17 150 17 385 1 00-0 50 




ATOM 


5 


CB LEUS 93 


-33.996 16.167 18.941 1.00 0.02 




ATOM 


6 


CG LEU S 93 


-35.339 15.426 19.042 1.00 0.05 


in 


ATOM * 


7 


CD1 LEU S 93 


-36.286 15.694 17.861 1.00-0.01 




ATOM 


8 


CD2 LEU S 93 


-35.131 13.920 19.266 1.00-0.01 




ATOM 


9 1H LEU S 93 


-31.777 14.490 17.116 1.00 0.15 




a TO* A 


in 


2H LEU S 93 


-32.810 13.922 18.239 1.00 0.15 




ATOM 


11 3H LEU S 93 


-31.641 14.971 18.665 1.00 0.15 


15 


ATOM 


12 


N ARG S 94 


-33.510 17.650 16.801 1.00-0.52 




ATOM 


13 


CA ARG S 94 


-33.484 19.028 16.320 1.00 0.24 




ATOM 


14 


C ARG S 94 


-34.175 19.857 17.377 1.00 0.53 




ATOM 


15 


O ARG S 94 


-33.609 20.841 17.874 1.00-0.50 




ATOM 


16 


CB ARG S 94 


-34.128 19.146 14.929 1.00 0.05 


20 


ATOM 


17 


CG ARG S 94 


-33.300 18.494 13.811 1.00 0.06 




ATOM 


18 


CD ARG S 94 


-34.051 18.379 12.475 1.00 0.11 




ATOM 


19 


NE ARG S 94 


-33.282 17.579 11.523 1.00-0.49 




ATOM 


20 


CZ ARG S 94 


-33.792 16.435 11.016 1.00 0.81 




ATOM 


21 


NH1 ARG S 94 


-35.003 15.996 11.360 1.00-0.63 


25 • 


ATOM 


22 


NH2 ARG S 94 


-33.068 15.730 10.153 1.00-0.63 




ATOM 


23 


H ARG S 94 


-34.386 17.167 16.817 1.00 0.25 




ATOM 


24 


HE ARG S 94 


-32.370 17.873 11.238 1.00 0.29 




ATOM 


25 1HH1 ARG S 94 


-35.352 15.145 10.968 1.00 0.36 




ATOM 


26 2HH1 ARG S 94 


-35.565 16.510 12.008 1.00 0.36 




ATOM 


27 1HH2 ARG S 94 


-33.433 14.882 9.769 1.00 0.36 




ATOM 


28 2HH2 ARG S 94 


-32.159 16.050 9.887 1.00 0.36 




ATOM 


29 


N ASN S 95 


-35.397 19.461 17.721 1.00-0.52 




ATOM 


30 


CA ASNS 95 


-36.183 20.166 18.729 1.00 0.22 




ATOM 


31 


C ASN S 95 


-37.332 19.318 19.219 1.00 0.53 


35 


ATOM 


32 


O ASN S 95 


-37.969 18.605 18.430 1.00-0.50 




ATOM 


33 


CB ASN S 95 


-36.706 21.522 18.229 1.00 0.00 
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ATOM 


34 CG ASNS 95 




ATOM 


35 OD1 ASN S 95 




ATOM 


36 ND2ASNS 95 




ATOM 


37 H ASN S 95 


5 


ATOM 


38 1HD2 ASN S 95 




ATOM 


39 2HD2ASNS 95 




ATOM 


40 N LEUS 96 




ATOM 


41 CA LEUS 96 




ATOM 


42 C LEUS 96 


10 


ATOM 


43 O LEU S 96 




ATOM 


44 CB LEUS 96 




ATOM 


45 CG LEUS 96 




ATOM 


46 CD1 LEU S 96 




ATOM 


47 CD2 LEU S 96 


15 


ATOM 


48 H LEUS 96 




ATOM 


49 N THRS 97 




ATOM 


50 CA THRS 97 




ATOM 


51 C THRS 97 




ATOM 


52 O THRS 97 


20 


ATOM 


53 CB THRS 97 




ATOM 


54 OG1 THR S 97 




ATOM 


55 CG2THRS 97 




ATOM 


56 H THRS 97 




ATOM 


57 HG1 THR S 97 


25 


ATOM 


58 N ILES 98 




ATOM 


59 CA ILES 98 




ATOM 


60 C ILES 98 




ATOM 


61 O ILES 98 




ATOM 


62 CB ILES 98 


30 


ATOM 


63 CG1 ILE S 98 




ATOM 


64 CG2 ILE S 98 




ATOM 


65 CD1 ILE S 98 




ATOM 


66 H ILES 98 




ATOM 


67 N VALS 99 


35 


ATOM 


68 CA VALS 99 




ATOM 


69 C VALS 99 



-36.363 22.606 19.233 1.00 0.68 
-37.231 23.137 19.913 1.00-0.47 
-35.056 22.909 19.294 1.00-0.87 
-35.797 18.657 17.281 1.00 0.25 
-34.725 23.610 19.925 1.00 0.34 
-34.397 22.435 18.709 1.00 0.34 
-37.598 19.391 20.520 1.00-0.52 
-38.679 18.626 21.134 1.00 0.20 
-39.965 19.383 20.901 1.00 0.53 
-40.388 20.181 21.751 1.00-0.50 
-38.396 18.419 22.630 1.00 0.02 
-37.665 17.107 22.956 1.00 0.05 
-38.413 15.848 22.489 1.00-0.01 
-36.219 17.129 22.436 1.00-0.01 
-37.044 19.987 21.103 1.00 0.25 
-40.588 19.136 19.753 1.00-0.52 
-41.840 19.794 19.394 1.00 0.27 
-42.809 18.826 18.759 1.00 0.53 
-42.400 17.929 18.007 1.00-0.50 
-41.572 20.990 18.466 1.00 0.21 
-42.795 21.657 18.160 1.00-0.55 
-40.854 20.578 17.171 1.00 0.01 
-40.193 18.480 19.110 1.00 0.25 
-42.562 22.479 17.722 1.00 0.31 
-44.093 19.003 19.059 1.00-0.52 
-45.140 18.142 18.518 1.00 0.20 
-46.361 18.894 18.043 1.00 0.53 
-46.834 19.818 18.721 1.00-0.50 
-45.444.17.014 19.516 1.00 0.03 
-46.244 15.862 18.888 1.00 0.02 
-46.123 17.547 20.788 1.00 0.00 
-45.346 14.810 18.219 1.00 0.00 
-44.356 19.747 19.673 1.00 0.25 
-46.872 18.500 16.880 1.00-0.52 
-48.051 19.135 16.298 1.00 0.20 
-49.253 18.226 16.395 1.00 0.53 
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ATOM 70 O VALS 99 -49.110 16.994 16.408 1.00-0.50 

ATOM 71 CB VALS 99 -47.698 19.525 14.854 1.00 0.03 

ATOM 72 CG1VALS 99 -46.688 20.681 14.780 1.00 0.01 

ATOM 73 CG2VALS 99 -47.287 18.331 13.978 1.00 0.01 

5 ATOM 74 H VALS 99 -46.441 17.747 16.383 1.00 0.25 

ATOM 75 N ASPS 100 -50.436 18.829 16.461 1.00-0.52 

ATOM 76 CA ASP S 100 -51.684 18.077 16.557 1.00 0.25 

ATOM 77 C ASPS 100 -51.849 17.392 15.222 1.00 0.53 

ATOM 78 O ASPS 100 -51.673 16.169 15.115 1.00-0.50 

10 ATOM 79 CB ASP S 100 -52.858 19.041 16.789 1.00-0.21 

ATOM 80 CG ASPS 100 -52.508 20.428 16.287 1.00 0.62 

ATOM 81 OD1 ASPS 100 -52.854 20.755 15.156 1.00-0.71 

ATOM 82 OD2 ASPS 100 -51.889 21.189 17.0Z3 1.00-0.71 

ATOM 83 H ASPS 100 -50.481 19.828 16.445 1.00 0.25 

15 ATOM 84 N SER S 101 -52.187 18.177 14.202 1.00-0.52 

ATOM 85 CA SER S 101 -52.382 17.652 12.854 1.00 0.29 

ATOM 86 C SER S 101 -53.017 16.282 12.846 1.00 0.53 

ATOM 87 O SER S 101 -52.684 15.439 12.000 1.00-0.50 

ATOM 88 CB SER S 101 -51.034 17.621 12.116 1.00 0.19 

20 ATOM 89 OG SER S 101 -51.236 17.656 10.705 1.00-0.55 

ATOM 90 H SER S 101 -52.314 19.157 14.356 1.00 0.25 

ATOM 91 HG SER S 101 -50.419 17.987 10.324 1.00 0.31 

ATOM 92 N GLY S 102 -53.930 16.058 13.786 1.00-0.52 

ATOM 93 CA GLY S 102 -54.626 14.779 13.900 1.00 0.25 

25 ATOM 94 C GLY S 102 -53.788 13.536 13.720 1.00 0.53 

ATOM 95 O GLY S 102 -54.199 12.596 1 3.024 1.00-0.50 

ATOM 96 H GLY S 102 -54.153 16.782 14.439 1.00 0.25 

ATOM 97 N LEU S 103 -52.615 13.527 14.347 1.00-0.52 

ATOM 98 CA LEU S 103 -51.701 12.392 14.263 1.00 0.20 

30 ATOM 99 C LEU S 103 -51.521 11.844 15.658 1.00 0.53 

ATOM 100 O LEU S 103 -51.586 12.596 16.643 1.00-0.50 

ATOM 101 CB LEU S 103 -50.367 12.833 13.641 1.00 0.02 

ATOM 102 CG LEU S 103 -49.757 14.089 14.284 1.00 0.05 

ATOM 103 CD1LEUS 103 -48.873 13.793 15.505 1.00-0.01 

35 ATOM 104 CD2LEUS 103 -49.015 14.944 13.244 1.00-0.01 

ATOM 105 H LEU S 103 -52.343 14.319 14.894 1.00 0.25 
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ATOM 106 N LYS S 104 

ATOM 107 CA LYS S 104 

ATOM 108 C LYS S 104 

ATOM 109 O LYS S 104 

5 ATOM 110 CB LYS S 104 

ATOM 111 CG LYS S 104 

ATOM 112 CD LYS S 104 

ATOM 113 CE LYS S 104 

ATOM 114 NZ LYS S 104 

10 ATOM 115 H LYS S 104 

ATOM 116 1 HZ LYS S 104 

ATOM 117 2HZ LYS S 104 

ATOM 118 3HZ LYS S 104 





ATOM 


119 


N PHE S 105 


15 


ATOM 


120 


CA PHE S 105 




ATOM 


121 


C PHE S 105 




ATOM 


122 


O PHE S 105 




ATOM 


123 


CB PHE S 105 




ATOM 


124 


CG PHE S 105 


20 


ATOM 


125 


CD1 PHE S 105 




ATOM 


126 


CD2 PHE S 105 




ATOM 


127 


CE1 PHE S 105 




ATOM 


128 


CE2 PHE S 105 




ATOM 


129 


CZ PHE S 105 


25 


ATOM 


130 


H PHE S 105 




ATOM 


131 


N VAL S 1 06 




ATOM 


132 


CA VAL S 106 




ATOM 


133 


C VAL S 106 




ATOM 


134 


O VAL S 106 


30 


ATOM 


135 


CB VAL S 106 




ATOM 


136 


CG1 VALS 106 




ATOM 


137 


CG2 VALS 106 




ATOM 


138 


H VAL S 106 




ATOM 


139 


N ALAS 107 


35 


ATOM 


140 


CA ALA S 107 




ATOM 


141 


C ALA S 107 



PCT/CA97/00539 

-51.296 10.536 15.747 1.00-0.52 
-51.102 9.870 17.032 1.00 0.23 
-49.623 9.824 17.329 1 00 0.53 
-49.013 8.744 17.326 1.00-0.50 
-51.738 8.471 17.005 1.00 0.04 
-52.308 8.008 18.355 1.00 0.05 
-53.074 6.679 18.249 1.00 0.05 
-52.171 5.461 18.503 1.00 0.22 
-52.891 4.233 18.143 1.00-0.27 
-51.256 9.986 14.913 1.00 0.25 
-53.368 4.364 17.227 1.00 0.31 
-53.600 4.018 18.873 1.00 0.31 
-52.219 3.443 18.071 1.00 0.31 
-49.043 10.993 17.585 1.00-0.52 
-47.619 11.101 17.889 1.00 0.21 
-46.933 10.882 16.562 1.00 0.53 
-46.495 11.844 15.914 1.00-0.50 
-47.162 10.024 18.886 1.00 0.04 
-45.861 10.412 19.549 1.00 0.01 
-44.683 9.692 19.251 1.00-0.01 
-45.849 11.493 20.457 1.00-0.01 
-43.474 10.058 19.873 1.00 0.00 
-44.641 11.860 21.078 1.00 0.00 
-43.466 11.139 20.780 1.00 0.00 
-49.592 11.829 17.571 1.00 0.25 
-46.838 9.619 16.156 1.00-0.52 
-46.200 9.259 14.893 1.00 0.20 
-44.818 9.859 14.800 1.00 0.53 
-44.261 9.989 13.699 1.00-0.50 
-47.136 9.709 13.760 1.00 0.03 
-46.649 9.261 12.372 1.00 0.01 
-48.612 9.356 14.005 1.00 0.01 
-47.213 8.889 16.726 1.00 0.25 
-44.263 10.225 15.951 1.00-0.52 
-42.930 10.820 16.013 1.00 0.22 
-42.034 9.846 16.740 1.00 0.53 
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ATOM 


142 


O ALA S 107 


-41.035 


10.247 17.354 1.00-0.50 




ATOM 


143 


CB ALA S 107 


-42.939 


12.156 16.772 1.00 0.03 




ATOM 


144 


H ALA S 107 


-44.766 


10.093 16.805 1.00 0.25 




ATOM 


145 


N TYR S 108 


-42.388 


8.565 16.671 1.00 -0.52 




ATOM 


146 


CA TYR S 108 


-41.616 


7.513 17.325 1.00 0.25 




ATOM 


147 


C TYR S 108 


-40.181 


7.445 16.861 1.00 0.53 




ATOM 


148 


O TYR S 108 


-39.320 


6.888 17.557 1.00-0.50 




ATOM 




CB TYR S 108 


-42.269 


6.142 17.090 1.00 0.02 




ATOM 




PO TYR S 108 


-43.761 


6 173 17 332 1 00 0 00 


10 


ATOM 


151 


CD1 TYR S 108 


-44.625 


6.208 16.220 1.00-0.04 




ATOM 


152 


CD2 TYR S 108 


-44.248 


6.166 18.656 1.00-0.04 




ATOM 


153 


CE1 TYR S 108 


-46.012 


6.237 16.436 1.00 0.10 




ATOM 


154 


CE2 TYR S 1 08 


-45.636 


6.196 18.871 1.00 0.10 




ATOM 


155 


CZ TYR S 108 


-46.500 


6.231 17.759 1.00-0.12 


15 


ATOM 


156 


OH TYR S 108 


-47.863 


6.259 17.970 1.00-0.37 




ATOM 


157 


H TYR S 108 


-43.207 


8.308 16.159 1.00 0.25 




ATOM 


158 


HH TYR S 108 


-48.199 


7.127 17.791 1.00 0.34 




ATOM 


159 


N LYS S 109 


-39.921 


8.011 15.685 1.00-0.52 




ATOM 


160 


CA LYS S 109 


-38.579 


8.021 15.112 1.00 0.23 


20 


ATOM 


161 


C LYS S 109 


-37.724 


8.976 15.910 1.00 0.53 




ATOM 


162 


O LYS S 109 


-36.497 


8.811 15.985 1.00-0.50 




ATOM 


163 


CB LYS S 109 


-38.641 


8.396 13.622 1.00 0.04 




ATOM 


164 


CG LYS S 109 


-39.200 


9.801 13.345 1.00 0.05 




ATOM 


165 


CD LYS S 109 


-40.553 


9.764 12.617 1.00 0.05 


25 


ATOM 


166 


CE LYS S 109 


-41.148 


11.167 12.416 1.00 0.22 




ATOM 


167 


NZ LYS S 109 


-42.394 


11.066 11.645 1.00-0.27 




ATOM 


168 


H LYS S 109 


-40.662 


8.444 15.173 1.00 0.25 




ATOM 


169 1HZ LYS S 109 


^*2.761 


12.020 11.449 1.00 0.31 




ATOM 


170 2HZ LYS S 109 


-43.099 


10.531 12.191 1.00 0.31 


30 


ATOM 


171 3 HZ LYS S 109 


-42.209 


10.576 10.746 1.00 0.31 




ATOM 


172 


N ALAS 110 


-38.368 


9.974 16.509 1.00-0.52 




ATOM 


173 


CA ALAS 110 


17 C71 
-Ol .Oil 






ATOM 


174 


C ALAS 110 


-37.673 


10.482 18.742 1.00 0.53 




ATOM 


175 


O ALAS 110 


-36.724 


10.741 19.497 1.00-0.50 


35 


ATOM 


176 


CB ALAS 110 


-38.368 


12.341 17.251 1.00 0.03 




ATOM 


177 


H ALAS 110 


-39.360 


10.050 16.409 1.00 0.25 



79 

SUBSTITUTE SHEET (RULE 26) 

BNSOOCID <WO 9B0604BA2J_> 



WO 98/06048 



PCT/CA97/00539 



ATOM 178 N PHE S 111 
ATOM 179 CA PHE S 111 
ATOM 180 C PHE S 111 
ATOM 181 O PHE S 111 
5 ATOM 182 CB PHE S 111 
ATOM 183 CG PHE S 111 
ATOM 184 CD1 PHE S 111 
ATOM 185 CD2 PHE S 111 
ATOM 186 CE1 PHE S 111 
10 ATOM 187 CE2PHES 111 
ATOM 188 CZ PHE S 111 
ATOM 189 H PHE S 111 
ATOM 190 N LEU 3 112 
ATOM 191 CA LEU S 112 
15 ATOM 192 C LEU S 112 
ATOM 193 O LEU S 112 
ATOM 194 CB LEU S 112 
ATOM 195 CG LEU S 112 
ATOM 196 CD1 LEU S 112 
20 ATOM 197 CD2LEUS 112 
ATOM 198 H LEU S 112 
ATOM 199 N LYS S 113 
ATOM 200 CA LYS S 113 
ATOM 201 C LYS S 113 
25 ATOM 202 O LYS S 113 
ATOM 203 CB LYS S 113 
ATOM 204 CG LYS S 113 
ATOM 205 CD LYS S 113 
ATOM 206 CE LYS S 113 
30 ATOM 207 NZ LYS S 113 
ATOM 208 H LYS S 113 
ATOM 209 1HZ LYS S 113 
ATOM 210 2HZ LYS S 113 
ATOM 211 3 HZ LYS S 113 
35 ATOM 212 N ASNS114 
ATOM 213 CA ASN S 114 



-38.736 9.775 19.114 1.00-0.52 
-38.873 9.238 20.465 1.00 0.21 
-37.731 8.259 20.599 1.00 0.53 
-37.123 8.144 21.673 1.00-0.50 
-40.207 8.499 20.656 1.00 0.04 
-40.461 8.200 22.115 1.00 0.01 
-41.035 9.191 22.942 1.00-0.01 
-40.116 6.930 22.626 1.00-0.01 
-41.269 8.906 24.301 1.00 0.00 
-40.349 6.645 23.983 1.00 0.00 
-40.924 7.636 24.807 1.00 0.00 
-39.467 9.602 18.455 1.00 0.25 
-37 439 7.552 19.511 1.00 -.0.52 
-36.360 6.568 19.494 1.00 0.20 
-35.074 7.293 19.811 1.00 0.53 
-34.249 6.798 20.594 1.00-0.50 
-36.305 5.868 18.127 1.00 0.02 
-35.374 4.646 18.082 1.00 0.05 
-36.074 3.321 18.421 1.00-0.01 
-34.628 4.563 16.740 1.00-0.01 
-37.970 7.692 18.675 1.00 0.25 
-34.900 8.464 19.205 1.00-0.52 
-33.705 9.275 19.417 1.00 0.23 
-33.633 9.650 20.878 1.00 0.53 
-32.578 9.508 21.513 1.00-0.50 
-33.728 10.505 18.496 1.00 0.04 
-33.318 10.217 17.043 1.00 0.05 
-33.954 11.198 16.045 1.00 0.05 
-33.790 10.739 14.587 1.00 0.22 
-34.498 9.467 14.390 1.00-0.27 
-35.605 8.807 18.585 1.00 0.25 
-35.239 9.365 15.113 1.00 0.31 
-33.826 8.678 14.472 1.00 0.31 
-34.933 9.453 13.446 1.00 0.31 
-34.754 10.128 21.412 1.00-0.52 
-34.832 10.531 22.813 1.00 0.22 
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ATOM 




C A^JNI S 1 14 


-34.508 9.379 23.733 1.00 0.53 




ATOM 


Oi f\ 


O ASN S 114 

\_/ null w • ' ~ 


-33.878 9.570 24.784 1.00-0.50 




A i KJWl 


£. ID 


rn ASM S 1 14 


-36.198 11.131 23.180 1.00 0.00 




A I Ulvi 


217 


CG ASN S 114 


-36.349 12.495 22.535 1.00 0.68 


b 


A I DM 


218 


OD1 ASN S 114 


-37.101 12.669 21.585 1.00-0.47 




ATOM 


219 


ND2 ASN S 114 


-35.597 13.457 23.097 1.00-0.87 




ATOM 


220 


H ASNS 114 


-35 572 10 217 20 844 1.00 0.25 




A TAH il 

ATOM 


221 1HD2 ASN S 114 


-35.623 14.393 22.746 1.00 0.34 




A I KJW\ 


222 2HD2 ASN S 114 


-35.002 13.248 23.873 1.00 0.34 


1 U 


A TOM 

A I vJM 


223 


N SERS115 


-34.937 8.182 23.342 1.00-0.52 




ATOM 


224 


CA SER S 115 


-34.696 6.977 24.131 1.00 0.29 




ATOM 
A I Ulvl 


225 


C SER S 115 


-33.240 6.580 24.160 1.00 0.53 




A I UM 


226 


. — r— i— i r% A A c 

<u ocn o i i ij 


oo tec c oi7 o«; i« 1 nn _n 




ATOM 


227 


CB SER S 115 


-35.553 5.825 23.584 1.00 0.19 


1 

I u 


ATOM 


228 


OG SER S 115 


-36.939 6.156 23.655 1.00-0.55 




ATOM 


229 


H SER S 115 


-35.440 8.096 22.482 1.00 0.25 




ATOM 


230 


HG SER S 115 


-37.081 6.859 23.017 1.00 0.31 




AT^^il 


231 


N ASNS 116 


-32.540 6.850 23.062 1.00-0.52 




A I \JWl 


232 


CA ASN S 116 


-31.121 6.522 22.949 1.00 0.22 


on 


ATOM 


233 


C ASNS 116 


-30.264 7.559 23.632 1.00 0.53 




ATOM 


234 


O ASNS 116 


-29.329 7.216 24.372 1.00-0.50 




ATOM 


235 


CB ASN S 116 


-30.671 6.343 21.490 1.00 0.00 




ATOM 


236 


CG ASN S 116 


-30.988 4.935 21.025 1.00 0.68 




ATOM 
A I UM 


237 


OD1 ASN S 116 


-30.139 4.053 21.046 1.00-0.47 


ZD 


ATOM 
A I Ulvl 


238 


ND2 ASN S 116 


-32.252 4.764 20.603 1.00-0.87 




ATOM 


239 


H ASNS 116 


-32.991 7.290 22.286 1.00 0.25 . 




ATOM 
A I \Jl\n 


240 1HD2 ASNS 116 


-32.564 3.872 20.275 1.00 0.34 




ATOK/I 


241 2HD2 ASN S 116 


-32.898 5.528 20.611 1.00 0.34 




ATAH 

A I Ulvl 


242 


N LEU S 117 


-30.578 8.828 23.387 1.00-0.52 


on 


ATOM 


243 


CA LEU S 117 


-29.835 9.937 23.980 1.00 0.23 




ATOM 


244 


C LEU S 117 


-30.840 10.862 24.645 1.00 0.50 




A I KJWi 


245 


O LEU S 117 


-31 725 11 354 23 951 1.00-0.25 




ATOM 


246 


CB LEU S 117 


-29.015 10.660 22.900 1.00 0.02 




ATOM 


247 


CG LEU S 117 


-27.751 9.905 22.458 1.00 0.05 


35 


ATOM 


248 


CD1 LEU S 117 


-27.875 9.246 21.075 1.00-0.01 




ATOM 


249 


CD2 LEU S 117 


-26.506 10.802 22.546 1.00-0.01 
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ATOM 250 OXTLEUS 117 -30.723 11.077 25.849 1.00-0.25 

ATOM 251 H LEU S 117 -31.344 9.038 22.780 1.00 0.25 
END 

•Principal residues are bolded. 



82 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9B06048A2_I_> 



WO 98/06048 PCT/CA97/00539 



Appendix 3. Geometry of the p75 NTR binding site for neurotrophic in PDB format. 

REMARK 1 Second CRD of p75 NTR , bound to NGF. 

5 REMARK 2 

ATOM 1 N CYSR 39 -32.488-11.161 6.915 1.00 0.00 
ATOM 2 CA CYSR 39 -33.024 -9.817 7.111 1.00 0.00 
ATOM 3 C CYS R 39 -33.595 -9.148 5.878 1.00 0.00 
ATOM 4 O CYSR 39 -34.221 -8.095 5.967 1.00 0.00 
10 ATOM 5 CB CYSR 39 -31.991 -8.907 7.780 1.00 0.00 
ATOM 6 SG CYSR 39 -30.272 -9.354 7.401 1.00 0.00 
ATOM 7 1H CYSR 39 -31.698-11.178 6.287 1.00 0.00 
. . . ,*wr% i-i no 4-m 1 1 coc 7 T7«; 1 nn n nn 

ATOM O Uio R J» -o^.. n w - ■ i — ....w . 

ATOM 93H CYSR 39 -33.166-11.799 6.523 1.00 0.00 

15 ATOM 10 N LEU R 40 -33.398 -9.779 4.707 1.00 0.00 

ATOM 11 CA LEUR 40 -34.095 -9.260 3.530 1.00 0.00 

ATOM 12 C LEUR 40 -35.322-10.097 3.220 1.00 0.00 

ATOM 13 O LEUR 40 -36.442 -9.632 3.041 1.00 0.00 

ATOM 14 CB LEUR 40 -33.148 -9.230 2.327 1.00 0.00 

20 ATOM 15 CG LEUR 40 -31.880 -8.404 2.561 1.00 0.00 

ATOM 16 CD1 LEUR 40 -30.862 -8.604 1.438 1.00 0.00 

ATOM 17 CD2LEUR 40 -32.184 -6.924 2.806 1.00 0.00 

ATOM 18 H LEUR 40 -32.867 -10.625 4.675 1.00 0.00 

ATOM 19 N ASPR 41 -35.034-11.407 3.188 1.00 0.00 

25 ATOM 20 CA ASPR 41 -36.070-12.408 2.947 1.00 0.00 

ATOM 21 C ASPR 41 -35.822-13.609 3.848 1.00 0.00 

ATOM 22 O ASPR 41 -34.792-13.673 4.525 1.00 0.00 

ATOM 23 CB ASPR 41 -36.054-12.772 1.449 1.00 0.00 

ATOM 24 CG ASPR 41 -37.195-13.703 1.074 1.00 0.00 

30 ATOM 25 0D1ASPR 41 -38.299-13.520 1.568 1.00 0.00 

ATOM 26 OD2ASPR 41 -36.979-14.633 0.314 1.00 0.00 

ATOM 27 H ASPR 41 -34.111 -11.739 3.385 1.00 0.00 

ATOM 28 N SERR 42 -36.757-14.563 3.825 1.00 0.00 

ATOM 29 CA SERR 42 -36.565-15.812 4.545 1.00 0.00 

35 ATOM 30 C SERR 42 -35.305-16.572 4.150 1.00 0.00 

ATOM 31 O SERR 42 -34.755-17.350 4.918 1.00 0.00 
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ATOM 32 CB SERR 42 -37.816-16.678 4.398 1.00 0.00 

ATOM 33 OG SER R 42 -38.988-15.930 4.780 1.00 0.00 

ATOM 34 H SERR 42 -37.497-14.449 3.168 1.00 0.00 

ATOM 35 HG SERR 42 -39.404-15.604 3.984 1.00 0.00 

ATOM 36 N VALR 43 -34.794-16.284 2.940 1.00 0.00 

ATOM 37 CA VALR 43 -33.503-16.883 2.574 1.00 0.00 

ATOM 38 C VALR 43 -32.270-16.339 3.305 1.00 0.00 

ATOM 39 O VALR 43 -31.128-16.743 3.081 1.00 0.00 

ATOM 40 CB VALR 43 -33.300-16.823 1.054 1.00 0.00 

ATOM 41 CG1VALR 43 -34.389-17.623 0.337 1.00 0.00 

ATOM 42 CG2VALR 43 -33.208-15.383 0.537 1.00 0.00 

ATOM 43 H VALR 43 -35.321 -15.679 2.336 1.00 0.00 

ATOM 44 N THR R 44 -32.539-15.360 4.166 1.00 0.00 

ATOM 45 CA THRR 44 -31.443-14.732 4.879 1.00 0.00 

ATOM 46 C THRR 44 -31.701 -14.680 6.367 1.00 0.00 

ATOM 47 O THRR 44 -32.659-14.063 6.830 1.00 0.00 

ATOM 48 CB THRR 44 -31.171 -13.339 4.300 1.00 0.00 

ATOM 49 OG1THRR 44 -32.382-12.578 4.130 1.00 0.00 

ATOM 50CG2THRR 44 -30.430-13.431 2.962 1:00 0.00 

ATOM 51 H THRR 44 -33.481 -15.090 4.359 1.00 0.00 

ATOM 52 HG1THRR 44 -33.031 -12.981 4.718 1.00 0.00 

ATOM 53 N PHER 45 -30.805-15.368 7.080 1.00 0.00 

ATOM 54 CA PHER 45 -30.829-15.256 8.526 1.00 0.00 





ATOM 


55 


C PHER 45 


-30.085 -13.996 


8.932 1.00 0.00 


25 


ATOM 


56 


O PHER 45 


-29.293 -13.430 


8.175 1.00 0.00 




ATOM 


57 


CB PHER 45 


-30.243 -16.531 


9.157 1.00 0.00 




ATOM 


58 


CG PHE R 45 


-30.514-16.587 


10.647 1.00 0.00 




ATOM 


59 


CD1 PHE R 45 


-31.844 -16.581 


11.126 1.00 0.00 




ATOM 


60 


CD2 PHE R 45 


-29.423 -16.628 


11.539 1.00 0.00 


30 


ATOM 


61 


CE1 PHE R 45 


-32.085 -16.592 


12.512 1.00 0.00 




ATOM 


62 


CE2 PHE R 45 


-29.661 -16.637 


12.927 1.00 0.00 




ATOM 


63 


CZ PHE R 45 


-30.989-16.611 


13.400 1.00 0.00 




ATOM 


64 


H PHER 45 


-30.025 -15.788 


6.623 1.00 0.00 




ATOM 


65 


N SERR 46 


-30.382-13.575 


10.161 1.00 0.00 


35 


ATOM 


66 


CA SER R 46 


-29.961 -12.259 


10.626 1.00 0.00 




ATOM 


67 


C SERR 46 


-28.732 -12.274 


11.517 1.00 0.00 
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ATOM 68 O SER R 46 
ATOM 69 CB SER R 46 
ATOM 70 OG SER R 46 
ATOM 71 H SER R 46 
5 ATOM 72 HG SER R 46 
ATOM 73 N ASP R 47 
ATOM 74 CA ASP R 47 
ATOM 75 C ASP R 47 
ATOM 76 O ASP R 47 
10 ATOM 77 CB ASP R 47 
ATOM 78 CG ASP R 47 
ATOM 79 OD1 ASP R 47 
ATOM 80 OD2 ASP R 47 
ATOM 81 H ASP R 47 
15 ATOM 82 N VAL R 48 
ATOM 83 CA VAL R 48 
ATOM 84 C VAL R 48 
ATOM 85 O VAL R 48 
ATOM 86 CB VAL R 48 
20 ATOM 87 CG1 VAL R 48 
ATOM 88 CG2VALR 48 
ATOM 89 H VAL R 48 
ATOM 90 N VAL R 49 
ATOM 91 CA VAL R 49 
25 ATOM 92 C VAL R 49 
ATOM 93 O VAL R 49 
ATOM 94 CB VAL R 49 
ATOM 95 CG1 VAL R 49 
ATOM 96 CG2 VAL R 49 
30 ATOM 97 H VAL R 49 
ATOM 98 N SER R 50 
ATOM 99 CA SER R 50 
ATOM 100 C SER R 50 
ATOM 101 O SER R 50 
* 35 ATOM 102 CB SER R 50 
ATOM 103 OG SERR 50 



-28.497-11.407 12.346 1.00 0.00 
-31.156-11.630 11.337 1.00 0.00 
-31.890-12.664 12.026 1.00 0.00 
-31.046-14.080 10.714 1.00 0.00 

-32.811 -12.512 11.820 1.00 0.00 
-27.906-13.314 11.317 1.00 0.00 
-26.772-13.543 12.226 1.00 0.00 
-25.567-12.610 12.077 1.00 0.00 
-24.410-13.027 12.086 1.00 0.00 
-26.337-15.025 12.168 1.00 0.00 
-25.807 -15.429 10.794 1.00 0.00 
-26.456-15.180 9.790 1.00 0.00 
-24.712 -f 5.961 10.694 1.00 0.00 
-28.048-13.919 10.531 1.00 0.00 
-25.857-11.312 11.875 1.00 0.00 
-24.733-10.395 11.691 1.00 0.00 
-24.130 -9.906 13.003 1.00 0.00 
-22.914 -9.763 13.131 1.00 0.00 
-25.108 -9.258 10.720 1.00 0.00 
-26.299 -8.421 11.194 1.00 0.00 
-23.886 -8.416 10.349 1.00 0.00 
-26.767-10.954 12.073 1.00 0.00 
-25.034 -9.713 13.984 1.00 0.00 
-24.767 -9.685 15.431 1.00 0.00 
-23.506 -9.045 16.022 1.00 0.00 
-23.103 -9.361 17.142 1.00 0.00 
-25.003-11.080 16.043 1.00 0.00 
-26.440-11.541 15.783 1.00 0.00 
-23.973-12.125 15.594 1.00 0.00 
-25.987 -9.707 13.681 1.00 0.00 
-22.912 -8.118 15.261 1 00 0.00 
-21.729 -7.376 15.719 1 00 0.00 
-21.546 -6.123 14.873 1.00 0.00 
-21.505 -4.977 15.336 1.00 0.00 
-20.431 -8.205 15.623 1.00 0.00 
-20.525 -9.489 16.273 1.00 0.00 
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ATOM 104 H SERR 50 -23.312 -7.897 14.372 1.00 0.00 

ATOM 105 HG SERR 50 -21.302 -9.403 16.827 1.00 0.00 

ATOM 106 N ALAR 51 -21.454 -6.446 13.572 1.00 0.00 

ATOM 107 CA ALAR 51 -21.546 -5.403 12.562 1.00 0.00 

ATOM 108 C ALAR 51 -23.013 -5.196 12.263 1.00 0.00 

ATOM 109 O ALAR 51 -23.862 -5.866 12.848 1.00 0.00 

ATOM 110 CB ALAR 51 -20.808 -5.825 11.288 T.00 0.00 

ATOM 111 H ALAR 51 -21.633 -7.392 13.330 1.00 0.00 

ATOM 112 N THRR 52 -23.266 -4.259 11 348 1.00 0.00 
ATOM 113 CA THRR 52 -24.806 -3.698 11.228 1.00 0.00 

ATOM 114 C THRR 52 -25.679 -4.557 10.579 1.00 0.00 

ATOM 115 O THRR 52 -26.564 -5.091 11.239 1.00 0.00 

.. ^..r,n en .0 344 10.532 1.00 0.00 

ATOM 1Tb" CD « nr\ r\ ji . — 

ATOM 117 OG1 THRR 52 -23.205 -1.766 10.845 1.00 0.00 
ATOM 118 CG2THRR 52 -25.624 -1.389 10.891 1 00 0.00 
ATOM 119 H THRR 52 -22.533 -3.835 10.820 1.00 0.00 
ATOM 120 HG1 THRR 52 -23.136 -0.977 10.314 1.00 0.00 
ATOM 121 N GLUR 53 -25.602 -4.625 9.241 1.00 0.00 
ATOM 122 CA GLUR 53 -26.765 -5.151 8.526 1.00 0.00 
ATOM 123 C GLUR 53 -26.656 -6.149 7.362 1.00 0.00 
ATOM 124 O GLUR 53 -27.702 -6.492 6.808 1.00 0.00 
ATOM 125 CB GLUR 53 -27.678 -3.966 8.146 1.00 0.00 
ATOM 126 CG GLUR 53 -27.452 -3.289 6.779 1.00 0.00 
ATOM 127 CD GLUR 53 -25.983 -3.036 6.488 1.00 0.00 
ATOM 128 OE1 GLUR 53 -25.530 -3.408 5.413 1.00 0.00 
ATOM 129 OE2GLUR 53 -25.284 -2.484 7.329 1.00 0.00 
ATOM 130 H GLUR 53 -24.901 -4.146 8.704 1.00 0.00 
ATOM 131 N PROR 54 -25.448 -6.634 6.953 1-00 0.00 
ATOM 132 CA PROR 54 -25.503 -7.661 5.912 1.00 0.00 
ATOM 133 C PROR 54 -26.227 -8.929 6.351 1 00 0.00 
ATOM 134 O PROR 54 -26.254 -9.343 7.509 1.00 0.00 
ATOM 135 CB PROR 54 -24.032 -7.892 5.550 1.00 0.00 
ATOM 136 CG PROR 54 -23.286 -6.653 6.033 1.00 0.00 
ATOM 137 CD PRO R 54 -24.070 -6.275 7.278 1.00 0.00 
ATOM 138 N CYSR 55 -26.841 -9.564 5.355 1.00 0.00 
ATOM 139 CA CYSR 55 -27.561 -10.782 5.715 1.00 0.00 



86 



9806O4SA2J_> 



SUBSTITUTE SHEET (RULE 26) 



WO 98/06048 



PCT/CA97/00539 





ATOM 


140 


C CYS R 55 




ATOM 


141 


O CYS R 55 




ATOM 


142 


CB CYS R 55 




ATOM 


143 


SG CYS R 55 


5 


ATOM 


144 


H CYS R 55 




ATOM 


145 


N LYS R 56 




ATOM 


146 


CA LYS R 56 




ATOM 


147 


C LYS R 56 




ATOM 


148 


O LYS R 56 


10 


ATOM 


149 


CB LYS R 56 




ATOM 


150 


CG LYS R 56 




ATOM 


151 


CD LYS R 56 




ATOM 


152 


CE LYS R 56 




ATOM 


153 


NZ LYS R 56 


15 


ATOM 


154 


H LYS R 56 




ATOM 


155 1HZ LYS R 56 




ATOM 


156 2HZ LYS R 56 




ATOM 


157 3 HZ LYS R 56 




ATOM 


158 


N PRO R 57 


20 


ATOM 


159 


CA PRO R 57 




ATOM 


160 


C PROR 57 




ATOM 


161 


O PROR 57 




ATOM 


162 


CB PRO R 57 




ATOM 


163 


CG PROR 57 


25 


ATOM 


164 


CD PRO R 57 




ATOM 


165 


N CYSR 58 




ATOM 


166 


CA CYS R 58 




ATOM 


167 


C CYSR 58 




ATOM 


168 


O CYSR 58 


30 


ATOM 


169 


CB CYS R 58 




ATOM 


170 


SG CYS R 58 




ATOM 


171 


H CYS R 58 




ATOM 


172 


N THR R 59 




ATOM 


173 


CA THR R 59 


35 


ATOM 


174 


C THR R 59 




ATOM 


175 


O THR R 59 



-26.758-12.020 5.403 1.00 0.00 
-25.722-11.942 4.747 1.00 0.00 
-28.884-10.816 4.980 1.00 0.00 
-29.966 -9.423 5.373 1.00 0.00 
-26.798 -9.206 4.422 1.00 0.00 
-27.264-13.165 5.871 1.00 0.00 
-26.589-14.362 5.398 1.00 0.00 
-27.549-15.386 4.837 1.00 0.00 
-28.519-15.789 5.477 1.00 0.00 
-25.714-14.916 6.522 1.00 0.00 
-24.805-16.085 6.153 1.00 0.00 
-23.591 -16.147 7.077 1.00 0.00 
-23.022-17.552 7.202 1.00 0.00 
-23.951 -18.331 8.015 1.00 0.00 
-28.051 -13.262 6.482 1.00 0.00 
-23.881 -18.006 8.995 1.00 0.00 
-23.728-19.338 7.946 1.00 0.00 
-24.928-18.196 7.691 1.00 0.00 
-27.236-15.830 3.598 1.00 0.00 
-27.952-16.989 3.057 1.00 0.00 
-27.718-18.237 3.898 1.00 0.00 
-26.590-18.524 4.312 1.00 0.00 
-27.397-17.079 1.631 1.00 0.00 
-26.018-16.418 1.682 1.00 0.00 
-26.218-15.301 2.698 1.00 0.00 
-28.842-18.939 4.129 1.00 0.00 
-28.833-20.145 4.956 1.00 0.00 
-28.246-21.406 4.309 1.00 0.00 
-27.067-21.414 3.958 1.00 0.00 
-30.219-20.336 5.550 1.00 0.00 
-31.446-20.570 4.242 1.00 0.00 
-29.694-18.544 3.776 1.00 0.00 
-29.040-22.485 4.171 1.00 0.00 
-28.405-23.777 3.884 1.00 0.00 
-29.331 -24.758 3.170 1.00 0.00 
-30.560-24.647 3.205 1.00 0.00 
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ATOM 176CBTHRR 59 -27.872-24.385 5.190 1.00 0.00 

ATOM 177 OG1THRR 59 -27.853-23.393 6.235 1.00 0.00 

ATOM 178 CG2THRR 59 -26.489-25.021 5.023 1.00 0.00 

ATOM 179 H THRR 59 -30.024-22.480 4.346 1.00 0.00 

5 ATOM 180 HG1THRR 59 -27.610-23.881 7.025 1.00 0.00 

ATOM 181 N GLUR 60 -28.701 -25.728 2.496 1.00 0.00 

ATOM 182 CA GLUR 60 -29.419-26.674 1.647 1.00 0.00 

ATOM 183 C GLUR 60 -30.161 -27.753 2.390 1.00 0.00 

ATOM 184 O GLUR 60 -29.788-28.927 2.459 1.00 0.00 

0 ATOM 185 CB GLUR 60 -28.473-27.229 0.578 1.00 0.00 

ATOM 186 CG GLUR 60 -28.436-26.362 -0.692 1.00 0.00 

ATOM 187 CD GLUR 60 -28.332-24.886 -0.343 1.00 0.00 
A-rrm/i mp ncir:iiiR RD -29.269-24.151 -0.619 1.00 0.00 

r~v i wi vi iww w w • — «■ » ■ » 

ATOM 189 OE2GLUR 60 -27.340-24.471 0.243 1.00 0.00 

5 ATOM 190 H GLUR 60 -27.701 -25.728 2.526 1.00 0.00 

ATOM 191 N CYSR 61 -31.270-27.273 2.945 1.00 0.00 

ATOM 192 CA CYS R 61 -32.170-28.156 3.661 1.00 0.00 

ATOM 193 C CYSR 61 -33.085-28.944 2.755 1.00 0.00 

ATOM 194 O CYSR 61 -34.299-28.751 2.709 1.00 0.00 

10 ATOM 195 CB CYSR 61 -32.910-27.347 4.714 1.00 0.00 

ATOM 196 SG CYSR 61 -31.740-26.942 6.031 1.00 0.00 

ATOM 197 H CYSR 61 -31.509-26.320 2.772 1.00 0.00 

ATOM 198 N VALR 62 -32.425-29.827 1.999 1.00 0.00 

ATOM 199 CA VALR 62 -33.134-30.711 1.091 1.00 0.00 

25 ATOM 200 C VALR 62 -32.620-32.122 1.268 1.00 0.00 

ATOM 201 O VALR 62 -31.507-32.351 1.744 1.00 0.00 
ATOM 202 CB VALR 62 -32.966-30.257 -0.371 1.00 0.00 
ATOM 203 CG1 VALR 62 -33.771 -28.988 -0.662 1.00 0.00 
ATOM 204 CG2VALR 62 -31.491 -30.099 -0.762 1.00 0.00 

30 ATOM 205 H VALR 62 -31.439-29.935 2.116 1.00 0.00 

ATOM 206 N GLY R 63 -33.482-33.057 0.831 1.00 0.00 
ATOM 207 CA GLY R 63 -33.115-34.472 0.773 1.00 0.00 
ATOM 208 C GLY R 63 -32.332-34.987 1.965 1.00 0.00 
ATOM 209 O GLY R 63 -32.817 -35.076 3.086 1.00 0.00 
35 ATOM 210 H GLY R 63 -34.346-32.742 0.449 1.00 0.00 
ATOM 211 N LEU R 64 -31.064-35.302 1.662 1.00 0.00 
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ATOM 212 CA LEU R 64 
ATOM 213 C LEU R 64 
ATOM 214 O LEUR 64 
ATOM 215 CB LEU R 64 
5 ATOM 216 CG LEU R 64 
ATOM 217 CD1 LEU R 64 
ATOM 218 CD2 LEU R 64 
ATOM 219 H LEUR 64 
ATOM 220 N GLN R 65 
10 ATOM 221 CA GLN R 65 
ATOM 222 C GLN R 65 
ATOM 223 O GLN R 65 
ATOM 224 CB GLN R 65 
ATOM 225 CG GLN R 65 
15 ATOM 226 CD GLN R 65 
ATOM 227 OE1 GLN R 65 
ATOM 228 NE2 GLN R 65 
ATOM 229 H GLN R 65 
ATOM 230 1HE2 GLN R 65 
20 ATOM 231 2HE2 GLN R 65 
ATOM 232 N SER R 66 
ATOM 233 CA SER R 66 
ATOM 234 C SER R 66 
ATOM 235 O SER R 66 
25 ATOM 236 CB SER R 66 
ATOM 237 OG SER R 66 
ATOM 238 H SER R 66 
ATOM 239 HG SER R 66 
ATOM 240 N MET R 67 
30 ATOM 241 CA MET R 67 
ATOM 242 C MET R 67 
ATOM 243 O MET R 67 
ATOM 244 CB MET R 67 
ATOM 245 CG MET R 67 
. 35 ATOM 246 SD MET R 67 
ATOM 247 CE MET R 67 



-30.174-35.897 2.660 1.00 0.00 
-29.939-35.063 3.905 1.00 0.00 
-29.748-35.602 4.993 1.00 0.00 
-28.818-36.268 2.047 1.00 0.00 
-28.759-37.594 1.273 1.00 0.00 
-29.223-38.770 2.135 1.00 0.00 
-29.458-37.548 -0.089 1.00 0.00 
-30.789-35.155 0.715 1.00 0.00 
-29.920-33.749 3.667 1.00 0.00 
-29.848-32.806 4.767 1.00 0.00 
-31.138-32.025 4.898 1.00 0.00 
-31.327-30.896 4.444 1.00 0.00 
-28.607-31.923 4.628 1.00 0.00 
-27.300-32.698 4.852 1.00 0.00 
-27.227-33.242 6.271 1.00 0.00 
-28.016-32.916 7.148 1.00 0.00 
-26.211 -34.066 6.484 1.00 0.00 
-30.093-33.389 2.753 1.00 0.00 
-26.039-34.449 7.390 1.00 0.00 
-25.596-34.408 5.773 1.00 0.00 
-32.061 -32.731 5.554 1.00 0.00 
-33.390-32.190 5.816 1.00 0.00 
-33.405-31.182 6.954 1.00 0.00 
-32.729-31.340 7.966 1.00 0.00 
-34.355-33.332 6.140 1.00 0.00 
-34.545-34.205 5.013 1.00 0.00 
-31.773-33.614 5.922 1.00 0.00 
-33.695-34.243 4.566 1.00 0.00 
-34.238-30.147 6.731 1.00 0.00 
-34.452-29.049 7.683 1.00 0.00 
-34.580-29.454 9.146 1.00 0.00 
-35.591 -30.030 9.547 1.00 0.00 
-35.701 -28.290 7.216 1.00 0.00 
-36.123-27.081 8.053 1.00 0.00 
-37.648-26.329 7.454 1.00 0.00 
-37.010-25.683 5.898 1.00 0.00 
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ATOM 


248 


H MET R 67 




ATOM 


249 


N SER R 68 




ATOM 


250 


CA SER R 68 




ATOM 


251 


C SER R 68 


5 


ATOM 


252 


O SER R 68 




ATOM 


253 


CB SERR 68 




ATOM 


254 


OG SER R 68 




ATOM 


255 


H SER R 68 




ATOM 


256 


HG SER R 68 


10 


ATOM 


257 


N ALA R 69 




ATOM 


258 


CA ALA R 69 




ATOM 


259 


C ALAR 69 




ATOM 


260 


O ALAR 69 




ATOM 


261 


CB ALA R 69 


15 


ATOM 


262 


H ALA R 69 




ATOM 


263 


N PRO R 70 




ATOM 


264 


CA PROR 70 




ATOM 


265 


C PRO R 70 




ATOM 


266 


O PRO R 70 


20 


ATOM 


267 


CB PRO R 70 




ATOM 


268 


CG PRO R 70 




ATOM 


269 


CD PRO R 70 




ATOM 


270 


N CYSR 71 




ATOM 


271 


CA CYS R 71 


25 


ATOM 


272 


C CYSR 71 




ATOM 


273 


O CYS R 71 




ATOM 


274 


CB CYS R 71 




ATOM 


275 


SG CYS R 71 




ATOM 


276 


H CYSR 71 


30 


ATOM 


277 


N VALR 72 




ATOM 


278 


CA VAL R 72 




ATOM 


279 


C VALR 72 




ATOM 


280 


O VALR 72 




ATOM 


281 


CB VALR 72 


35 


ATOM 


282 


CG1 VAL R 72 




ATOM 


283 


CG2 VAL R 72 



-34.671 -30.083 5.833 1.00 0.00 
-33.535-29.137 9.905 1.00 0.00 
-33.655-29.183 11.360 1.00 0.00 
-34.023-27.827 11.919 1.00 0.00 
-34.862 -27.691 12.800 1.00 0.00 
-32.358-29.637 12.009 1.00 0.00 
-31.643-30.509 11.115 1.00 0.00 
-32.761 -28.695 9.455 1.00 0.00 

-31.558-30.009 10.294 1.00 0.00 
-33.375-26.830 11.302 1.00 0.00 
-33.917-25.487 11.420 1.00 0.00 
-34.131 -24.964 10.022 1.00 0.00 
-33.406-25.351 9.101 1.00 0.00 
-32.949-24.567 12.163 1.00 0.00 
-32.643-27.065 10.660 1.00 0.00 
-35.166-24.103 9.901 1.00 0.00 
-35.349-23.341- 8.665 1.00 0.00 
-34.275-22.271 8.587 1.00 0.00 
-33.227-22.350 9.229 1.00 0.00 
-36.770-22.793 8.852 1.00 0.00 
-36.928-22.604 10.359 1.00 0.00 
-36.166-23.793 10.921 1.00 0.00 
-34.550-21.233 7.785 1.00 0.00 
-33.601 -20.119 7.809 1.00 0.00 
-33.970-19.007 8.772 1.00 0.00 
-33.266-18.019 8.957 1.00 0.00 
-33.418-19.571 6.407 1.00 0.00 
-33.225-20.894 5.186 1.00 0.00 
-35.393-21.195 7.252 1.00 0.00 
-35.153-19.212 9.378 1.00 0.00 
-35.518-18.365 10.505 1.00 0.00 
-35.220-19.134 11.781 1.00 0.00 
-34.442-20.088 11.731 1.00 0.00 
-36.968-17.871 10.369 1.00 0.00 
-37.035-16.768 9.311 1.00 0.00 
-37.963-18.992 10.064 1.00 0.00 
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ATOM 


284 


H VALR 72 




ATOM 


285 


N GLU R 73 




ATOM 


286 


CA GLU R 73 




ATOM 


287 


C GLU R 73 


5 


ATOM 


288 


O GLU R 73 




ATOM 


289 


CB GLUR 73 




ATOM 


290 


CG GLU R 73 




ATOM 


291 


CD GLU R 73 




ATOM 


292 


OE1 GLU R 73 


10 


ATOM 


293 


OE2 GLU R 73 




ATOM 


294 


H GLU R 73 




ATOM 


295 


N ALAR 74 




ATOM 


296 


CA ALAR 74 




ATOM 


297 


C ALA R 74 


15 


ATOM 


298 


O ALA R 74 




ATOM 


299 


CB ALAR 74 




ATOM 


300 


H ALA R 74 




ATOM 


301 


N ASP R 75 




ATOM 


302 


CA ASP R 75 


20 


ATOM 


303 


C ASPR 75 




ATOM 


304 


O ASP R 75 




ATOM 


305 


CB ASP R 75 




ATOM 


306 


CG ASP R 75 




ATOM 


307 


OD1 ASP R 75 


25 


ATOM 


308 


OD2ASPR 75 




ATOM 


309 


H ASP R 75 




ATOM 


310 


N ASP R 76 




ATOM 


311 


CA ASP R 76 




ATOM 


312 


C ASPR 76 


30 


ATOM 


313 


O ASPR 76 




ATOM 


314 


CB ASP R 76 




ATOM 


315 


CG ASP R 76 




ATOM 


316 


OD1 ASP R 76 




ATOM 


317 


OD2 ASP R 76 


35 


ATOM 


318 


H ASPR 76 




ATOM 


319 


N ALAR 77 



-35.684-20.042 9.226 1.00 0.00 
-35.790-18.668 12.904 1.00 0.00 
-35.488-19.248 14.217 1.00 0.00 
-34.000-19.471 14.515 1.00 0.00 
-33.279-18.511 14.794 1.00 0.00 
-36.399-20.461 14.483 1.00 0.00 
-37.842-20.111 14.899 1.00 0.00 
-38.628-19.399 13.805 1.00 0.00 
-39.196-20.066 12.948 1.00 0.00 
-38.693-18.173 13.816 1.00 0.00 
-36.613-18.100 12.819 1.00 0.00 
-33.549-20.733 14.428 1.00 0.00 
-32.139-20.969 14.723 1.00 0.00 
-31.209-20.464 13.628 1.00 0.00 
-31.627-19.858 12.643 1.00 0.00 
-31.896-22.456 14.994 1.00 0.00 
-34.142-21.443 14.054 1.00 0.00 
-29.901 -20.717 13.841 1.00 0.00 
-28.988-20.165 12.838 1.00 0.00 
-28.978-20.946 11.540 1.00 0.00 
-28.907-22.175 11.568 1.00 0.00 
-27.570-19.881 13.412 1.00 0.00 
-26.502-20.949 13.148 1.00 0.00 
-25.764-20.841 12.170 1.00 0.00 
-26.375-21.893 13.918 1.00 0.00 
-29.610-21.308 14.596 1.00 0.00 
-29.031 -20.167 10.438 1.00 0.00 
-28.563-20.650 9.133 1.00 0.00 
-28.887-22.099 8.799 1.00 0.00 
-28.004-22.960 8.851 1.00 0.00 
-27.048-20.433 9.048 1.00 0.00 
-26.651 -19.628 7.832 1.00 0.00 
-26.280-20.197 6.821 1.00 0.00 
-26.SS7 -18.412 7.895 1.00 0.00 
-29.310-19.218 10.546 1.00 0.00 
-30.181 -22.342 8.504 1.00 0.00 
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ATOM 320 CA ALA R 77 

ATOM 321 C ALAR 77 

ATOM 322 O ALAR 77 

ATOM 323 CB ALAR 77 

5 ATOM 324 H ALAR 77 

ATOM 325 N VAL R 78 

ATOM 326 CA VAL R 78 

ATOM 327 C VAL R 78 

ATOM 328 O VAL R 78 

10 ATOM 329 CB VAL R 78 

ATOM 330 CG1 VAL R 78 

ATOM 331 CG2 VAL R 78 

ATOM 332 H VAL R 78 

M I VJIVI OJJ IN UTSK fa 

15 ATOM 334 CA CYS R 79 

ATOM 335 C CYS R 79 

ATOM 336 O CYS R 79 

ATOM 337 CB CYS R 79 

ATOM 338 SG CYS R 79 

20 ATOM 339 H CYS R 79 

ATOM 340 N ARG R 80 

ATOM 341 CA ARG R 80 

ATOM 342 C ARG R 80 

ATOM 343 O ARG R 80 

25 ATOM 344 CB ARG R 80 

ATOM 345 CG ARG R 80 

ATOM 346 CD ARG R 80 

ATOM 347 NE ARG R 80 

ATOM 348 CZ ARG R 80 

30 ATOM 349 NH1 ARG R 80 

ATOM 350 NH2 ARG R 80 

ATOM 351 H ARG R 80 

ATOM 352 HE ARG R 80 

ATOM 353 1HH1 ARG R ®0 

35 ATOM 3§4 2HH1 ARG R SO 

ATOM 355 1HH2 ARG R 80 



-30.727-23.675 8.210 1.00 0.00 
-29.903-24.882 8.622 1.00 0.00 
-28.947-25.313 7.977 1.00 0.00 
-31.146-23.786 6.745 1.00 0.00 
-30.762-21.532 8.486 1.00 0.00 
-30.293-25.370 9.804 1.00 0.00 
-29.473-26.437 10.365 1.00 0.00 
-29.674-27.730 9.608 1.00 0.00 
-30.689-28.419 9.719 1.00 0.00 
-29.705-26.621 11.875 1.00 0.00 
-28.824-27.731 12.461 1.00 0.00 
-29.474-25.311 12.630 1.00 0.00 
-31.196-25.121 10.147 1.00 0.00 
-28.632-28.038 8.838 1.00 0.00 
-28.713-29.247 8.033 1.00 0.00 
-27.339-29.828 7.920 1.00 0.00 
-26.525-29.376 7.119 1.00 0.00 
-29.320-28.928 6.669 1.00 0.00 
-31.114-28.721 6.825 1.00 0.00 
-27.953-27.324 8.640 1.00 0.00 
-27.136-30.778 8.851 1.00 0.00 
-25.874-31.473 9.118 1.00 0.00 
-26.145-32.410 10.286 1.00 0.00 
-27.174-32.258 10.947 1.00 0.00 
-24.794-30.457 9.480 1.00 0.00 
-23.510 -30.612 8.671 1.00 0.00 
-23.609 -30.483 7.144 1.00 0.00 
-22.248 -30.464 6.602 1.00 0.00 
-21.752-29.350 6.025 1.00 0.00 
-20.446-29.201 5.899 1.00 0.00 
-22.565 -28.390 5.605 1.00 0.00 
-27.930-31.029 9.408 1.00 0.00 
-21.632 -31.222 6.850 1.00 0.00 
-20.086 -28.349 5.310 1.00 0.00 
-19.803-29.904 6.220 1.00 0.00 
-22.163-27.510 5.312 1.00 0.00 
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ATOM 356 2HH2ARGR 80 -23.555-28.546 5.585 1.00 0.00 

ATOM 357 N CYS R 81 -25.254-33.378 10.546 1.00 0 00 
ATOM 358 CA CYS R 81 -25.630-34.302 11.618 1.00 0 00 
ATOM 359 C CYS R 81 *4.657 -34.425 12.779 1.00 0 00 
5 ATOM 360 O CYS R 81 -23.437-34.482 12.654 1.00 0 00 
ATOM 361 CB CYS R 81 -25.971 -35.691 11.060 1.00 0 00 
ATOM 362 SG CYS R 81 -27.286-35.685 9.805 1.00 0.00 
ATOM 363 H CYS R 81 -24.383-33.437 10.057 1.00 0.00 
ATOM 364 N ALA R 82 -25.276-34.521 13.965 1.00 0 00 
10 ATOM 355 CA ALA R 82 -24.515-34.749 15.194 1.00 0 00 
ATOM 366 C ALA R 82 -25.304-35.664 16.121 1.00 0 00 
ATOM 357 O ALA R 82 -26.298-36.259 15.706 1.00 0 00 
ATOM 368 CB ALAR 82 -24.220-33.410 15.8B0 1.00 0.00 

,, rsL^n. oz -^/a -34.483 13.959 1.00 0 00 

15 ATOM 370 N TYR R 83 -24.873-35.700 17.401 1.00 0 00 
ATOM 371 CA TYR R 83 -25.613-36.307 18.519 1 00 0 00 
ATOM 372 C TYR R 83 -25.565-37.827 18.584 1.00 0 00 
ATOM 373 O TYR R 83 -24.974-38.412 19.487 1.00 0 00 
ATOM 374 CB TYR R 83 -27.078-35.829 18.634 1 00 0 00 
20 ATOM 375 CG TYR R 83 -27.226-34.328 18.781 1.00 0 00 
ATOM 376 CD1 TYR R 83 -26.962 -33.725 20.028 1 00 0 00 
ATOM 377 CD2TYRR 83 -27.670-33.574 17.674 1 00 0 00 
ATOM 378 CE1TYRR 83 -27.210-32.351 20.189 1 00 0 00 
ATOM 379 CE2TYRR 83 -27.914-32.198 17.834 1 00 0 00 
25 ATOM 380 CZ TYR R 83 -27.710-31.608 19.099 1 00 0 00 
ATOM 381 OH TYR R 83 -28.024-30.273 19.294 1 00 0 00 
ATOM 382 H TYR R 83 -23.940-35.374 17.555 1 00 0 00 
ATOM 383 HH TYR R 83 -28.217-29.854 18.456 1 00 0 00 
ATOM 384 N GLY R 84 -26.236-38.425 17.591 1 00 0 00 
30 ATOM 385 CA GLY R 84 -26.228-39.880 17.476 1 00 0 00 
ATOM 386 C GLY R 84 -25.429 -40.269 16.256 1 00 0 00 
ATOM 387 O GLY R 84 -24.379-40.900 16.321 1.00 0 00 
ATOM 388 H GLY R 84 -26.647-37.841 16.890 1.00 0 00 
ATOM 389 N TYR R 85 -25.951 -39.784 15.120 1 00 0 00 
35 ATOM 390 CA TYR R 85 -25.110-39.752 13.923 1.00 0 00 
ATOM 391 C TYR R 85 -24.142-38.590 13.997 1 00 0 00 
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ATOM 392 GTYRR 85 -24.434-37.463 13.597 1.00 0 00 
ATOM 393 CB TYR R 85 -25.941 -39.668 12.637 1 00 0 00 
ATOM 394 CG TYR R 85 -26.885-40.843 12.509 1 00 0 00 
ATOM 395 CD1TYRR 85 -26.367-42.140 12.304 1.00 0 00 
5 ATOM 396 CD2 TYR R 85 -28.271 -40.602 12.591 1 00 0 00 
ATOM 397 CE1TYRR 85 -27.258-43.221 12.185 1.00 0 00 
ATOM 398 CE2TYRR 85 -29.161 -41.681 12.467 1 00 0 00 
ATOM 399 CZ TYR R 85 -28.645-42.978 12.269 1.00 0 00 
ATOM 400 OH TYR R 85 -29.528-44.039 12.161 1 00 0 00 
10 ATOM 401 H TYR R 85 -26.801 -39.267 15.189 1 00 0 00 
ATOM 402 HH TYR R 85 -30.392-43.682 11.979 1 00 0 00 
ATOM 403 N TYR R 86 -23.001 -38.941 14.593 1 00 0 00 
ATOM 404 CA TYR R 86 -21.996-37.958 14.983 1.00 0.00 

ATOM 405 C TVRD Re ->« 

■ ' " ww IO-.9/.T// 13.824 1.00 0 00 

15 ATOM 406 O TYR R 86 -21.126-35.992 1 3.937 1 00 0 00 
ATOM 407 CB TYR R 86 -20.809-38.602 15.710 1 00 0 00 
ATOM 408 CG TYR R 86 -21.207-39.697 16.673 1.00 0 00 
ATOM 409 CD1 TYR R 86 -20.850-41.023 16.352 1 00 0 00 
ATOM 410 CD2TYRR 86 -21.890-39.374 17.863 1 00 0 00 
20 ATOM 411 CE1TYRR 86 -21.141 -42.050 17.264 1.00 0 00 
ATOM 412CE2TYRR 86 -22.184-40.403 18.774 1 00 0 00 
ATOM 413 CZ TYR R 86 -21.785-41.722 18.473 1 00 0 00 
ATOM 414 OH TYR R 86 -22.023-42.711 19.413 1 00 0 00 
ATOM 415 H TYR R 86 -22.958,39.890 14.897 1 00 0 00 
25 ATOM 416 HH TYR R 86 -21.495-43.480 19.212 1 00 0 00 
ATOM 417 N GLN--R 87 -21.218-37.946 12.732 1 00 0 00 
ATOM 418 CA GLN R 87 -20.570-37.470 11.510 1.00 0 00 
ATOM 419 C GLN R 87 -19.097-37.109 11.556 1 00 0 00 
ATOM 420 O GLN R 87 -18.514-36.712 12.563 1 00 0 00 
30 ATOM 421 CB GLN R 87 -21.334-36.316 10.855 1 00 0 00 
ATOM 422 CG GLN R 87 -22.628-36.720 10.170 1.00 0 00 
ATOM 423 CD GLN R 87 -22.807 -35.859 8.937 1 00 0 00 
ATOM 424 OE1 GLN R 87 -23.553 -34.880 8.908 1 00 0 00 
ATOM 425 NE2GLNR 87 -22.130-36.305 7.879 1.00 0 00 
35 ATOM 426 H GLN R 87 -21.584-38.879 12.734 1 00 0 00 
ATOM 427 1HE2GLNR 87 -22.399-35.962 6.979 1.00 0 00 



94 



>' <WO 9806048A2 l_> 



SUBSTITUTE SHEET (RULE 26) 



WO 98/06048 

ATOM 428 2HKGLNR87 -2,43,-37.0,6 7.960 ,.00 0.00 

ATOM N ASP R 88 -,8.536-37.2,0 ,0.345 ,00 0.00 

ATOM CA ASP R 88 -,7.727-36.069 9.929 , 00 0 00 

ATOM C ASP R 88 -,8.63,-35.3,8 8.978 ,.00 0.00 

. 5 ATOM 432 O ASP R 88 -19.200-35.901 6.064 1.00 0 00 
"3CBASPR08 

ATOM 434 CG ASP R 88 -,5.384-35.376 9.43, 1.00 0 00 

ATOM 435 0D1ASPR 88 -14.9,5 -34.822 8.442 1.00 0 00 

,0 ATOM 4M " ^ • 15 0S1 - 35 03J ««• I-* 0.00 

* HZ SPRSB " 1M «-»"« 0666 ,00 0 00 

ATOM N GLU R 89 -,8.683-34.048 9.30, 1.00 0.00 

ATOM 439 CA GLU R 89 -20., 63 -33.427 8.949 ,.00 0 00 

ATOM 440 C GLU R 89 -20.666-33.353 7.500 ,.00 0 00 

15 atom " " LUK 89 7", 1.00 0.00 

15 ATOM 442 CB GLU R 89 -20.252-32.098 9.722 1.00 0 00 

ATOM 443 CG GLU R 89 -,9.934-30.765 9.024 , 00 0 00 

ATOM 444 CD GLU R 89 -,8.588-30.734 8.336 ,.00 0.00 

ATOM 445 OE, GLU R 89 -,8.494 -,,.092 7,74 ,.00 0 00 

20 ATo! GLUR89 -"■•"■"•W "40 ,00 0.00 

20 ATOM 447 H GLU R 89 -,8.289-33.640 9.986 ,00 0 00 

ATOM 448 N THRR.90 -,9.709-33.499 6.555 ,00 0.00 

ATOM 449 CA THR R 90 -,9.933-32.94, 5.2,7 ,00 0 00 

ATOM 450 C THR R 90 -2,285 -33„8 4.552 ,00 0.00 
25 aZ THRR9 ° ^ 036-34 , 60 4.600 ,00 0.00 

ato! " ™ R " 90 " ,e 8,0 - 33 347 «* i-» o.oo 

ATOM 453 OG,THRR 90 -,7.674-33.636 4.984 , 00 0 00 

ATOM 454 CG2THRR90 

ATOM H THR R 90 -13.764-33.646 

ATOM 456 HG, THR R 90 -,7.327-33.064 5 444 , 00 0 00 
30 ATOM 457 N THR R 9, -21,47-3,0,7 

ATOM 458 CA THR R 91 -22.860-3,897 3.097 , 00 0 00 
ATOM 459 C THR R 9, 0 
ATOM - 23 ' 258 -33.149 2.323 ,00 0.00 

ATOM 460 O THRR 9, -22.484-33,76 ,526 ,00 0 00 

35 ATOM CB 91 -30 ^0 2,4, ,00 0 00 

35 ATOM 462 OG, THR R 91 .2,888 . 29 . 793 ^ 

ATOM CG2 THR R 91 -23.964-30.007 ,827 ,00 0,0 
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ATOM 464 H THR R 91 -20.984-31.271 3.860 1 00 0 00 
ATOM 465 HG1THRR 91 -20.894-29.846 2.096 1 00 0 00 
ATOM 466 N GLY R 92 -24.499-33.584 2.577 1.00 0 00 
ATOM 467 CA GLY R 92 -24.978-34.710 1.781 1.00 0 00 
5 ATOM 468 C GLY R 92 -24.860-36.093 2.409 1.00 0 00 
ATOM 469 O GLY R 92 -25.335-37.085 1.850 1 00 0 00 
ATOM 470 H GLY R 92 -25.022-33.210 3.337 1 00 0 00 
ATOM 471 N ARGR 93 -24.236-36.138 3.595 1.00 0 00 
ATOM 472 CA ARGR 93 -24.173-37.427 4.280 1 00 0 00 
10 ATOM 473 C ARC R 93 -24.418-37.314 5.777 1.00 0 00 
ATOM 474 O ARGR 93 -24.453-36.210 6.315 1.00 0 00 
ATOM 475 CB ARGR 93 -22.867-38.152 3.925 1.00 0 00 
ATOM 476 CG ARGR 93 -21.604-37.313 4.120 1.00 0.00 
ATOM 477 CD ARG R 93 -20.324-38.083 3.779 1.00 0 00 
15 ATOM 478 NE ARC R 93 -19.135-37.298 4.115 1 00 0 00 
ATOM 479 CZ ARGR 93 -18.683-37.255 5.391 1 00 0 00 
ATOM 480 NH1 ARG R 93 -17.741 -36.386 5.728 1 00 0 00 
ATOM 481 NH2 ARGR 93 -19.200-38.034 6.329 1 00 0 00 
ATOM 482 H ARG R 93 -23.801 -35.328 3.987 1 00 0 00 
20 ATOM 483 HE ARGR 93 -18.726-36.705 3.421 1 00 0 00 
ATOM 484 1HH1 ARGR 93 -17.400-36.385 6.666 1 00 0 00 
ATOM 485 2HH1 ARG R 93 -17.378-35.697 5.095 1 00 0 00 
ATOM 486 1HH2ARGR 93 -18.999-37.871 7.294 1 00 0 00 
ATOM 487 2HH2 ARG R 93 -19.832-38.785 6.135 1 00 0 00 
25 ATOM 488 N CYS R 94 -24.653-38.506 6.371 1.00 0 00 
ATOM 489 CA CYS R 94 -24.768-38.726 7.818 1 00 0 00 
ATOM 490 C CYSR 94 -24.488-40.182 8.172 1.00 0 00 
ATOM 491 O CYSR 94 -25.041 -41.119 7.594 1.00 0 00 
ATOM 492 CB CYSR 94 -26.138-38.358 8.397 1 00 0 00 
30 ATOM 493 SG CYS R 94 -26.692 -36.659 8.083 1 00 0 00 
ATOM 494 H CYSR 94 -24.611 -39.336 5.816 1.00 0.00 
ATOM 495 N GLU R 95 -23.572-40.320 9.142 1.00 0.00 
ATOM 496 CA GLU R 95 -22.945-41.588 9.514 1.00 0.00 
ATOM 497 C GLU R 95 -22.224-41.383 10.836 1.00 0 00 
35 ATOM 498 O GLU R 95 -22.258-40.278 11.382 1.00 0.00 
ATOM 499 CB GLU R 95 -21.981 -42.075 8.412 1.00 0.00 
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ATOM 500 CG GLU R 95 -20.721 -41.229 8.154 1.00 0 00 
ATOM 501 CD GLU R 95 -21.085-39.811 7.761 1.00 0 00 
ATOM 502 OE1GLUR 95 -21.467-39.570 6.627 1.00 0 00 
ATOM 503 OE2GLUR 95 -20.997-38.922 8.591 1.00 0 00 
5 ATOM 504 H GLU R 95 -23.203-39.480 9.540 1.00 0 00 
ATOM 505 N ALAR 96 -21.599-42.462 11.332 1 00 0 00 
ATOM 506 CA ALAR 96 -20.851 -42.305 12.578 1.00 0 00 
ATOM 507 C ALAR 96. -19.527-41.574.12.413 1.00 0 00 
ATOM 508 O ALAR 96 -18.742-41.952 11.544 1.00 0 00 
10 ATOM 509 CB ALAR 96 -20.605-43.668 13.231 1 00 0 00 
ATOM 510 OXTALAR 96 -19.293-40.619 13.151 1.00 0 00 
ATOM 511 H ALAR 96 -21.711 -43.344 10.671 1.00 0.00 
END 
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1 5 REMARK 1 Second CRD of p75™, bound to BDNF 
REMARK 2 

ATOM 1 N CYS R 39 -26.73, -23.425 17.186 1.00 0 00 
ATOM 2 CA CYS R 39 -29.491 -22.2,5 ,7.484 1.00 0 00 
ATOM 3 C CYSR 39 -30.988-22.434 ,7.523 1.00 0.00 
20 ATOM 4 O CYSR 39 -31.773 -2, 495 ,7.660 , 00 0 00 
ATOM 5 CB CYS R 39 -29.203-21.102 , 6.474 , 00 0 00 
ATOM 6 SO CYSR 39 -27.696 -2,. 3,2 ,5.47, , 00 ado 
ATOM 7 1H CYSR 39 -27.735-23.264 17 ,48 ,.00 0 00 
ATOM 8 2H CYSR 39 -28.868-24 , 53 , 7.874 1.00 0 00 
25 ATOM 9 3H CYSR 39 -28.962-23.835 16.296 1.00 0 00 
ATOM N LEU R 40 -31.38,-23.704 , 7.336 1.00 0 00 

ATOM „ CA LEU R 40 -32.802-24.005 , 7.397 , 00 0 00 
ATOM 12 C LEU R 40 -33.098-24.788 , 8.662 1.00 0 00 
ATOM O LEUR 40 -32.371 -24.712 19.645 1.00 o'oO 
30 ATOM CB LEU R 40 -33.224 -24.736 ,6.114 1.00 0 00 
ATOM 15 CG LEU R 40 -33.14, -23.868 ,4.854 , 00 0 00 
ATOM ,6CD1LEUR 40 -33.498-24.664 13.599 1.00 0 00 
ATOM 17CO2LEUR 40 -33.987 -22.598 , 4.966 ,.00 0 00 
35 HZ 18HLEUR4 ° -^34-24.467 17.3,4 1.00 0.00 
35 ATOM 19 N ASP R 41 -34.194-25.545 ,8.604 ,.00 0 00 
ATOM 20 CA ASP R 41 -34.394-26.559 , 9.630 1.00 0 00 
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ATOM 21 C ASPR 41 -33.510-27.762 19.326 1.00 0.00 

ATOM 22 O ASPR 41 -32.569-27.678 18.542 1.00 0.00 

ATOM 23 CB ASPR 41 -35.888-26.923 19.676 1.00 0.00 

ATOM 24 CG ASPR 41 -36.282-27.631 18.391 1.00 0.00 

5 ATOM 25 OD1ASPR 41 -36.462-28.842 18.426 1.00 0.00 

ATOM 26 OD2ASPR 41 -36.364-26.975 17.359 1.00 0.00 

ATOM 27 H ASPR 41 -34.766-25.640 17.790 1.00 0.00 

ATOM 28 N SERR 42 -33.860-28.892 19.945 1.00 0.00 

ATOM 29 CA SERR 42 -33.192-30.156 19.648 1.00 0.00 

10 ATOM 30 C SERR 42 -32.956-30.499 18.173 1.00 0.00 

ATOM 31 O SERR 42 -31.967-31.144 17.816 1.00 0.00 

ATOM 32 CB SER R 42 -33.975-31.256 20.359 1.00 0.00 

ATOM 33 OG SERR 42 -34.507-30.738 21.597 1.00 0.00 

ATOM O A i_j o?"rti-* * ~ 

...w... „ ' «w -.54.638 -28.916 20.568 1.00 0 00 

15 ATOM 35 HG SER R 42 -35.328-31.194 21.757 1.00 0.00 

ATOM 36 N VALR 43 -33.884-30.042 17.310 1.00 0.00 

ATOM 37 CA VALR 43 -33.712-30.372 15.895 1.00 0.00 

ATOM 38 C VALR 43 -32.454-29.826 15.235 1.00 0.00 

ATOM 39 O VALR 43 -31.932^30.402 14.281 1.00 0.00 

20 ATOM 40 CB VALR 43 -34.955-30.058 15.044 1.00 0.00 

ATOM 41 CG1 VALR 43 -36.186-30.773 15.604 1.00 0.00 

ATOM 42 CG2VALR 43 -35.182-28.560 14.824 1.00 0.00 

ATOM 43 H VALR 43 -34.615-29.433 17.633 1.00 0.00 

ATOM 44 N THRR 44 -31.938-28.722 15.786 1.00 0.00 

25 ATOM 45 CA THR R 44 -30.651 -28.248 15.283 1.00 0.00 

ATOM 46 C THRR 44 -29.869-27.538 16.372 1.00 0.00 

ATOM 47 O THRR 44 -30.165-26.391 16.716 1.00 0.00 

ATOM 48 CB THRR 44 -30.833-27.333 14.062 1.00 0 00 

ATOM 49 OG1 THR R 44 -31.913-27.770 13.215 1.00 0.00 

30 ATOM 50 CG2THRR 44 -29.538-27.231 13.256 1.00 0.00 

ATOM 51 H THRR 44 -32.435-28.226 16.507 1.00 0.00 

ATOM 52 HG1 THRR 44 -31.927-28.725 13.271 1.00 0.00 

ATOM 53 N PHER 45 -28.911 -28.291 16.916 1.00 0.00 

ATOM 54 CA PHER 45 -28.186-27.770 18.069 1.00 0 00 

35 ATOM 55 C PHER 45 -26.993-26.926 17.656 1.00 0.00 

ATOM 56 O PHER 45 -26.806-26.646 16.478 1.00 0.00 
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ATOM 57 CB PHE R 45 
ATOM 58 CG PHE R 45 
ATOM 59 CD1 PHE R 45 
ATOM 60 CD2 PHE R 45 
5 ATOM 61 CE1 PHE R 45 
ATOM 62 CE2 PHE R 45 
ATOM 63 CZ PHE R 45 
ATOM 64 H PHE R 45 
ATOM 65 N SER R 46 
10 ATOM 66 CA SER R 46 
ATOM 67 C SER R 46 
ATOM 68 O SERR 46 
ATOM 69 CB SERR 46 
ATOM 70 OG SER R 46 
15 ATOM 71 H SER R 46 
ATOM 72 HG SER R 46 
ATOM 73 N ASP R 47 
ATOM 74 CA ASP R 47 
ATOM 75 C ASP R 47 
20 ATOM 76 O ASP R 47 
ATOM 77 CB ASP R 47 
ATOM 78 CG ASP R 47 
ATOM 79 OD1 ASP R 47 
ATOM 80 O02ASPR 47 
25 ATOM 81 H ASP R 47 
ATOM 82 N VALR 48 
ATOM 83 CA VAL R 48 
ATOM 84 C VALR 48 
ATOM 85 O VAL R 48 
30 ATOM 86 CB VALR 48 
ATOM 87 CG1 VAL R 48 
ATOM 88 CG2VALR 48 
ATOM 89 H VALR 48 
ATOM 90 N VAL R 49 
* 35 ATOM 91 CA VAL R 49 
ATOM 92 C VAL R 49 



-27.822-28.924 19.006 1.00 0.00 
-26.899-29.937 18.362 1.00 0.00 
-27.449-31.094 17.770 1.00 0.00 
-25.503-29.718 18.376 1.00 0.00 
-26.592-32.040 17.179 1.00 0.00 
-24.646-30.663 17.785 1.00 0.00 
-25.200-31.813 17.186 1.00 0.00 
-28.575-29.099 16.440 1.00 0.00 
-26.169-26.504 18.621 1.00 0.00 
-25.321 -25.406 18.156 1.00 0.00 
-23.805-25.596 18.062 1.00 0.00 
-23.139-25.015 17.205 1.00 0.00 
-25.722-24.118 18.887 1.00 0.00 
-27.050-24.244 19.441 1.00 0.00 
-26.281 -26.700 19.596 1.00 0.00 
-26.881 -24.590 20.319 1.00 0.00 
-23.284-26.424 18.982 1.00 0.00 
-21.847-26.704 19.149 1.00 0.00 
-20.914-25.513 19.394 1.00 0.00 
-19.707-25.533 19.159 1.00 0.00 
-21.313 -27.830 18.204 1.00 0.00 
-20.687-27.453 16.853 1.00 0.00 
-20.787-28.220 15.898 1.00 0.00 
-20.046-26.421 16.716 1.00 0.00 
-23.930-26.730 19.679 1.00 0.00 
-21.525-24.456 19.953 1.00 0.00 
-20.791 -23.190 20.089 1.00 0.00 
-19.752-23.030 21.212 1.00 0.00 
-19.796-22.163 22.083 1.00 0.00 
-21.762-22.000 20.041 1.00 0.00 
-22.331 -21.847 18.628 1.00 0.00 
-22.865-22.095 21.103 1.00 0.00 
-22.434-24.585 20.351 1.00 0.00 
-18.722-23.884 21.116 1.00 0.00 
-17.548-23.718 21.974 1.00 0.00 
■15.827-22.386 21.754 1.00 0.00 
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ATOM 93 O VALR 49 -16.199 - 21.808 22.647 1.00 0.00 
ATOM 94 CB VALR 49 -16.621 -24.942 21.815 1.00 0.00 
ATOM 95 CG1 VALR 49 -16.113-25.126 20.379 1.00 0.00 
ATOM 96 CG2VALR 49 -15.490-24.948 22.848 1.00 0.00 
5 ATOM 97 H VALR 49 -18.738-24.575 20.398 1.00 0.00 
ATOM 98 N SERR 50 -16.984-21.895 20.512 1.00 0.00 
ATOM 99 CA SERR 50 -16.545-20.542 20.170 1.00 0.00 
ATOM 100 C SERR 50 -17.478-20.015 19.099 1.00 0.00 
ATOM 101 O SERR 50 -18.142-20.822 18.450 1.00 0.00 
10 ATOM 102 CB SERR 50 -15.104-20.582 19.665 1.00 0.00 
ATOM 103 OG SERR 50 -14.388-21.603 20.381 1.00 0.00 
ATOM 104 H SERR 50 -17.472-22.447 19.837 1.00 0.00 
ATOM 105 HG SERR 50 -14.702-21.503 21.280 1.00 0.00 
ATOM 106 N ALA R 51 *17.535 -18.689 18.939 1.00 0.00 
15 ATOM 107 CA ALAR 51 -18.444-18.111 17.945 1.00 0.00 
ATOM 108 C ALAR 51 -18.009-18.327 16.502 1.00 0.00 
ATOM 109 O ALAR 51 -17.384-17.480 15.860 1.00 0.00 
ATOM 110 CB ALAR 51 -18.633-16.614 18.202 1.00 0.00 
ATOM 111 H ALAR 51 -16.914-18.119 19.470 1.00 0.00 
20 ATOM 112 N THRR 52 -18.362-19.542 16.034 1.00 0.00 
ATOM 113 CA THRR 52 -17.855-20.040 14.756 1.00 0.00 
ATOM 114 C THRR 52 -18.740-21.103 14.084 1.00 0.00 
ATOM 115 O THRR 52 -18.814-22.264 14.488 1.00 0.00 
ATOM 116 CB THRR 52 -16.422-20.581 14.935 1.00 0.00 
25 ATOM 117 OG1 THRR 52 -15.633-19.712 15.770 1.00 0.00 
ATOM 118 CG2THRR 52 -15.723-20.802 13.589 1.00 0.00 
ATOM 119 H THRR 52 -18.809-20.200 16.644 1.00 0.00 
ATOM 120 HG1 THRR 52 -15.985-18.832 15.634 1.00 0.00 
ATOM 121 N GLUR 53 -19.349-20.674 12.968 1.00 0.00 
30 ATOM 122 CA GLUR 53 -20.398-19.674 13.115 1.00 0.00 
ATOM 123 C GLUR 53 -21.732-20.164 13.692 1.00 0.00 
ATOM 124 O GLUR 53 -22.195-19.586 14.668 1.00 0.00 
ATOM 125 CB GLUR 53 -20.639-18.835 11.840 1.00 0.00 
ATOM 126 CG GLUR 53 -19.550-18.814 10.755 1.00 0.00 
35 ATOM 127 CD GLUR S3 -19.530 -20.110 9.955 1.00 0.00 
ATOM 128 OE1 GLU R 53 -20.581 -20.583 9.522 1.00 0.00 
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ATOM 129 OE2GLUR 53 -18.455-20.666 9.758 1.00 0.00 
ATOM 130 H GLU R 53 -19.314-21.243 12.150 1.00 0.00 
ATOM 131 N PROR 54 -22.382-21.186 13.062 1.00 0.00 
ATOM 132 CA PROR 54 -23.801 -21.374 13.363 1.00 0.00 
5 ATOM 133 C PRO R 54 -24.087 -22.600 14.224 1.00 0.00 
ATOM 134 O PROR 54 -23.206-23.245 14.790 1.00 0.00 
ATOM 135 CB PROR 54 -24.322-21.543 11.933 1.00 0.00 
ATOM 136 CG PROR 54 -23.279-22.449 11.267 1.00 0.00 
ATOM 137 CD PROR 54 -21.973-22.056 11.958 1.00 0.00 
10 ATOM 138 N CYS R 55 -25.389-22.945 14.169 1.00 0.00 
ATOM 139 CA CYS R 55 -25.953-24.243 14.552 1.00 0.00 
ATOM 140 C CYSR 55 -25.269-25.446 13.903 1.00 0.00 
ATOM 141 O CYSR 55 -24.242-25 300 13.245 1.00 0.00 
ATOM 142 CB CYSR 55 -27.451 -24.242 14.222 1.00 0.00 
15 ATOM 143 SG CYSR 55 -28.150-22.588 13.922 1.00 0.00 
ATOM 144 H CYSR 55 -26.029-22.260 13.834 1.00 0.00 
ATOM 145 N LYSR 56 -25.873-26.628 14.098 1.00 0.00 
ATOM 146 CA LYSR 56 -25.241 -27.917 13.821 1.00 0.00 
ATOM 147 C LYSR 56 -26.256-29.045 13.957 1.00 0.00 
20 ATOM 148 O LYSR 56 -26.703-29.417 15.037 1.00 0.00 
ATOM 149 CB LYSR 56 -24.073-28.150 14.789 1.00 0.00 
ATOM 150 CG LYSR 56 -23.350-29.501 14.702 1.00 0.00 
ATOM 151 CD LYSR 56 -22.658-29.801 13.373 1.00 0.00 
ATOM 152 CE LYS R 56 -21.673 -30.966 13.503 1.00 0.00 
25 ATOM 153 NZ LYSR 56 -20.454-30.524 14.191 1.00 0.00 
ATOM 154 H LYSR 56 -26.725-26.673 14.623 1.00 0.00 
ATOM 155 1HZ LYSR 56 -19.887-31.370 14.402 1.00 0.00 
ATOM 156 2HZ LYSR 56 -20.632-30.000 15.073 1.00 0.00 
ATOM 157 3 HZ LYS R 56 -19.886 -29.946 13.544 1.00 0.00 
30 ATOM 158 N PROR 57 -26.633-29.599 12.787 1.00 0.00 
ATOM 159 CA PRO R 57 -27.416-30.837 12.811 1.00 0.00 
ATOM 160 C PROR 57 -26.644-31.966 13.476 1.00 0.00 
ATOM 161 O PROR 57 -25.420-31.911 13.609 1.00 0.00 
ATOM 162 CB PRO R 57 -27.665-31.109 11.323 1.00 0.00 
35 ATOM 163 CG PROR 57 -27.447-29.776 10.607 1.00 0.00 
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ATOM 164 CD PRO R 57 -26.356 -29.119 11.441 1.00 0.00 

ATOM 165 N CYSR 58 -27.418-32.994 13.860 1.00 0.00 

ATOM 166 CA CYSR 58 -26.761 -34.217 14.319 1.00 0.00 

ATOM 167 C CYSR 58 -26.309-35.070 13.141 1.00 0.00 

5 ATOM 168 O CYSR 58 -26.576-34.743 11.987 1.00 0.00 

ATOM 169 CB CYSR 58 -27.709-34.963 15.259 1.00 0.00 

ATOM 170 SG CYSR 58 -27.015-36.478 15.973 1.00 0.00 

ATOM 171 H CYSR 58 -28.408-32.886 13.776 1.00 0.00 

ATOM 172 N THRR 59 -25.623-36.173 13.459 1.00 0.00 

10 ATOM 173 CA THRR 59 -25.066-37.074 12.445 1.00 0.00 

ATOM 174 C THRR 59 -26.048-38.160 11.949 1.00 0.00 

ATOM 175 O THRR 59 -27.196-38.234 12.397 1.00 0.00 

ATOM 176 CB THRR 59 -23.767-37.636 13.055 1.00 0.00 

ATOM 177 OG1 THR R 59 -23.270-36.723 14.052 1.00 0.00 

15 ATOM 178 CG2THRR 59 -22.667-37.924 12.029 1.00 0.00 

ATOM 179 H THRR 59 -25.398-36.368 14.415 1.00 0.00 

ATOM 180 HG1THRR 59 -22.729-37.252 14.644 1.00 0.00 

ATOM 181 N GLUR 60 -25.578-38.985 10.999 1.00 0.00 

ATOM 182 CA GLUR 60 -26.467-40.006 10.445 1.00 0.00 

20 ATOM 183 C GLUR 60 -26.899-41.149 11.355 1.00 0.00 

ATOM 184 O GLUR 60 -26.152-42.083 11.672 1.00 0.00 

ATOM 185 CB GLUR 60 -25.943-40.541 9.111 1.00 0.00 

ATOM 186 CG GLUR 60 -27.011 -40.494 8.010 1.00 0.00 

ATOM 187 CD GLUR 60 -28.263-41.228 8.450 1.00 0.00 

25 ATOM 188 OE1 GLU R 60 -28.349-42.428 8.253 1.00 0.00 

ATOM 189 OE2 GLU R 60 -29.161 -40.605 9.003 1.00 0.00 

ATOM 190 H GLUR 60 -24.642-38.859 10.693 1.00 0.00 

ATOM 191 N CYSR 61 -28.169-41.001 11.750 1.00 0.00 

ATOM 192 CA CYSR 61 -28.775-41.881 12.742 1.00 0.00 

30 ATOM 193 C CYSR 61 -29.712-42.937 12.185 1.00 0.00 

ATOM 194 O CYSR 61 -30.148-43.845 12.890 1.00 0.00 

ATOM 195 CB CYSR 61 -29.564-41.086 13.782 1.00 0.00 

ATOM 196 SG CYSR 61 -28.791 -39.568 14.416 1.00 0.00 

ATOM 197 H CYSR 61 -28.690-40.307 11.255 1.00 0.00 

35 ATOM 198 N VAL R 62 -30.069-42.806 10.908 1.00 0.00 

ATOM 199 CA VAL R 62 -31.084-43.768 10.496 1.00 0.00 
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ATOM 


200 


C VAL R 62 




ATOM 


201 


O VAL R 62 




ATOM 


202 


CB VAL R 62 




ATOM 


203 


CG1 VAL R 62 


5 


ATOM 


204 


CG2 VAL R 62 




ATOM 


205 


H VAL R 62 




ATOM 


206 


N GLY R 63 




ATOM 


207 


CA GLY R 63 




ATOM 


208 


C GLY R 63 


10 


ATOM 


209 


O GLY R 63 




ATOM 


210 


H GLY R 63 




ATOM 


211 


N LEU R 64 




ATOM 


212 


CA LEU R 64 




ATOM 


213 


C LEU R 64 


15 


ATOM 


214 


O LEU R 64 




ATOM 


215 


CB LEU R 64 




ATOM 


216 


CG LEU R 64 




ATOM 


217 


CD1 LEU R 64 




ATOM 


218 


CD2 LEU R 64 


20 


ATOM 


219 


H LEU R 64 




ATOM 


220 


N GLN R 65 




ATOM 


221 


CA GLN R 65 




ATOM 


222 


C GLN R 65 




ATOM 


223 


O GLN R 65 


25 


ATOM 


224 


CB GLN R 65 




ATOM 


225 


CG GLN R 65 




ATOM 


226 


CD GLN R 65 




ATOM 


227 


OE1 GLN R 65 




ATOM 


228 


NE2 GLN R 65 


30 


ATOM 


229 


H GLN R 65 




ATOM 


230 1HE2GLN R 65 




ATOM 


231 2HE2 GLN R 65 




ATOM 


232 


N SER R 66 




ATOM 


233 


CA SER R 66 


35 


ATOM 


234 


C SER R 66 




ATOM 


235 


O SER R 66 



-30.571 -44.884 9.598 1.00 0.00 
-30.480-46.055 9.964 1.00 0.00 
-32.301 -43.050 9.878 1.00 0.00 
-33.489-44.005 9.713 1.00 0.00 
-32.705-41.812 10.685 1.00 0.00 
-29.584-42.202 10.268 1.00 0.00 
-30.336-44.466 8.349 1.00 0.00 
-30.443-45.442 7.270 1.00 0.00 
-29.148-45.809 6.584 1.00 0.00 
-29.004-46.882 6.009 1.00 0.00 
-30.082-43.508 8.197 1.00 0.00 
-28.206-44.867 6.628 1.00 0.00 
-26.859-45.283 6.260 1.00 0.00 
-26.145-45.795 7.489 1.00 0.00 
-25.525-46.855 7.545 1.00 0.00 
-26.088-44.113 5.644 1.00 0.00 
-26.754-43.525 4.397 1.00 0.00 
-26.073-42.229 3.958 1.00 0.00 
-26.848-44.540 3.255 1.00 0.00 
-28.391 -43.992 7.076 1.00 0.00 
-26.262-44.951 8.515 1.00 0.00 
-25.639-45.245 9.796 1.00 0.00 
-26.708-45.080 10.851 1.00 0.00 
-27.738-44.460 10.612 1.00 0.00 
-24.456-44.300 10.006 1.00 0.00 
-23.336-44.444 8.965 1.00 0.00 
-22.421 -45.624 9.253 1.00 0.00 
-21.315-45.443 9.762 1.00 0.00 
-22.857-46.816 8.833 1.00 0.00 
-26.850-44.141 8.449 1.00 0.00 
-22.292-47.635 8.846 1.00 0.00 
-23.800-46.898 8.500 1.00 0.00 
-26.473-45.715 12.001 1.00 0.00 
-27.657-45.886 12.838 1.00 0.00 
-27.604-45.224 14.204 1.00 0.00 
-26.542-44.815 14.682 1.00 0.00 
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ATOM 236 CB SERR 66 -28.013-47.373 12.884 1.00 0.00 

ATOM 237 OG SER R 66 -27.869-47.927 11.565 1.00 0.00 

ATOM 238 H SER R 66 -25.586-46.141 12.173 1.00 0.00 

ATOM 239 HG SER R 66 -28.474-47.423 11.013 1.00 0.00 

5 ATOM 240 N MET R 67 -28.808-45.110 14.779 1.00 0.00 

ATOM 241 CA MET R 67 -29.048-44.257 15.939 1.00 0.00 

ATOM 242 C MET R 67 -28.294-44.575 17.212 1.00 0.00 

ATOM 243 O MET R 67 -28.404-45.649 17.800 1.00 0.00 

ATOM 244 CB MET R 67 -30.556-44.203 16.197 1.00 0.00 

10 ATOM 245 CG MET R 67 -31.008-42.933 16.916 1.00 0.00 

ATOM 246 SD MET R 67 -32.793-42.833 17.123 1.00 0.00 

ATOM 247 CE MET R 67 -33.244-42.903 15.380 1.00 0.00 

ATOM 248 H MET R 67 -29.573-45.360 14.189 1.00 0.00 

ATOM 249 N SER R 68 -27.557-43.553 17.633 1.00 0.00 

15 ATOM 250 CA SER R 68 -26.992-43.620 18.977 1.00 0.00 

ATOM 251 C SERR 68 -27.868-42.986 20.035 1.00 0.00 

ATOM 252 O SERR 68 -27.813-43.326 21.215 1.00 0.00 

ATOM 253 CB SERR 68 -25.615-42.989 18.978 1.00 0.00 

ATOM 254 OG SERR 68 -25.029-43.228 17.693 1.00 0.00 

20 ATOM 255 H SERR 68 -27.423-42.756 17.044 1.00 0.00 

ATOM 256 HG SERR 68 -24.083-43.177 17.834 1.00 0.00 

ATOM 257 N ALA R 69 -28.706-42.060 19.521 1.00 0.00 

ATOM 258 CA ALAR 69 -29.837-41.437 20.214 1.00 0.00 

ATOM 259 C ALAR 69 -30.311 -40.307 19.312 1.00 0.00 

25 ATOM 260 O ALAR 69 -29.592-39.924 18.387 1.00 0.00 

ATOM 261 CB ALAR 69 -29.406-40.835 21.565 1.00 0.00 

ATOM 262 H ALAR 69 -28.493-41.701 18.610 1.00 0.00 

ATOM 263 N PRO R 70 -31.506-39.724 19.607 1.00 0.00 

ATOM 264 CA PRO R 70 -31.722-38.322 19.207 1.00 0.00 

30 ATOM 265 C PRO R 70 -30.580-37.436 19.699 1.00 0.00 

ATOM 266 O PRO R 70 -29.722-37.895 20.456 1.00 0.00 

ATOM 267 CB PRO R 70 -33.063-37.990 19.869 1.00 0.00 

ATOM 268 CG PRO R 70 -33.779-39.331 20.022 1.00 0.00 

ATOM 269 CD PRO R 70 -32.637-40.289 20.334 1.00 0.00 

35 ATOM 270 N CYS R 71 -30.549-36.191 19.227 1.00 0.00 

ATOM 271 CA CYS R 71 -29.381 -35.417 19.642 1.00 0.00 
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ATOM 


272 


C CYS R 71 




ATOM 


273 


O CYSR 71 




ATOM 


274 


CB CYS R 71 




ATOM 


275 


SG CYS R 71 


5 


ATOM 


276 


H CYS R 71 




ATOM 


277 


N VAL R 72 




ATOM 


278 


CA VAL R 72 




ATOM 


279 


C VAL R 72 




ATOM 


280 


O VAL R 72 


10 


ATOM 


281 


CB VAL R 72 




ATOM 


282 


CG1 VAL R 72 




ATOM 


283 


CG2 VALR 72 




ATOM 


284 


H VAL R 72 




ATOM 


285 


N GLU R 73 


15 


ATOM 


286 


CA GLU R 73 




ATOM 


287 


C GLU R 73 




ATOM 


288 


O GLU R 73 




ATOM 


289 


CB GLU R 73 




ATOM 


290 


CG GLU R 73 


20 


ATOM 


291 


CD GLU R 73 




ATOM 


292 


OE1 GLU R 73 




ATOM 


293 


OE2 GLU R 73 




ATOM 


294 


H GLU R 73 




ATOM 


295 


N ALA R 74 


25 


ATOM 


296 


CA ALA R 74 




ATOM 


297 


C ALA R 74 




ATOM 


298 


O ALA R 74 




ATOM 


299 


CB ALAR 74 




ATOM 


300 


H ALA R 74 


30 


ATOM 


301 


N ASP R 75 




ATOM 


302 


CA ASP R 75 




ATOM 


303 


C ASP R 75 




ATOM 


304 


O ASP R 75 




ATOM 


305 


CB ASP R 75 


35 


ATOM 


306 


CG ASP R 75 




ATOM 


307 


OD1 ASP R 75 



-29.730-34.124 20.337 1.00 0.00 
-30.848-33.618 20.262 1.00 0.00 
-28.418-35.200 18.474 1.00 0.00 
-27.801 -36.785 17.835 1.00 0.00 
-31.299-35.781 18.711 1.00 0.00 
-28.733-33.641 21.079 1.00 0.00 
-28.793-32.337 21.739 1.00 0.00 
-27.441 -31.695 21.515 1.00 0.00 
-26.587-32.323 20.897 1.00 0.00 
-29.087-32.477 23.244 1.00 0.00 
-30.554-32.827 23.502 1.00 0.00 
-28.125-33.455 23.932 1.00 0.00 
-27.860-34.132 21.089 1.00 0.00 
-27.282-30.465 22.038 1.00 0.00 
-25.992-29.787 21.908 1.00 0.00 
-24.848-30.522 22.569 1.00 0.00 
-24.610-30.447 23.773 1.00 0.00 
-26.073-28.328 22.365 1.00 0.00 
-24.825-27.556 21.923 1.00 0.00 
-24.974-26.067 22.159 1.00 0.00 
-25.947-25.478 21.698 1.00 0.00 
-24.103-25.484 22.793 1.00 0.00 
-28.030-30.065 22.564 1.00 0.00 
-24.183-31.276 21.690 1.00 0.00 
-22.982-32.030 22.006 1.00 0.00 
-22.211 -32.099 20.717 1.00 0.00 
-22.796-32.213 19.640 1.00 0.00 
-23.328-33.451 22.455 1.00 0.00 
-24.600-31.329 20.777 1.00 0.00 
-20.889-31.945 20.853 1.00 0.00 
-20.105-31.837 19.633 1.00 0.00 
-20.167-33.101 18.799 1.00 0.00 
-20.254-34.213 19.326 1.00 0.00 
-18.876 -31.413 19.976 1.00 0.00 
-18.194-30.327 19.038 1.00 0.00 
-17.534-29.398 19.470 1.00 0.00 
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ATOM 


308 


OD2 ASP R 75 


-18.410-30.388 17.841 1.00 0.00 




ATOM 


309 


H ASP R 75 


-20.452-31.968 21.747 1.00 0.00 




ATOM 


310 


N ASP R 76 


-20.194-32.858 17.466 1.00 0.00 




ATOM 


311 


CA ASP R 76 


-20.556-33.842 16.433 1.00 0.00 


5 


ATOM 


312 


C ASP R 76 


-21.294-35.063 16.965 1.00 0.00 




ATOM 


313 


O ASP R 76 


-20.725-36.140 17.153 1.00 0.00 




ATOM 


314 


CB ASP R 76 


-19.324-34.222 15.591 1.00 0.00 




ATOM 


315 


CG ASP R 76 


-18.850-33.033 14.765 1.00 0.00 




ATOM 


316 


OD1 ASP R 76 


-18.213-32.134 15.285 1.00 0.00 


10 


ATOM 


317 


OD2 ASP R 76 


-19.132-32.953 13.580 1.00 0.00 




ATOM 


318 


H ASPR 76 


-19.875-31.943 17.215 1.00 0.00 




ATOM 


319 


N ALAR 77 


-22.571 -34.806 17.303 1.00 0.00 




ATOM 


320 


CA ALAR 77 


-23.290-35.694 18.218 1.00 0.00 




ATOM 


321 


C ALA R 77 


-23.531 -37.073 17.651 1.00 0.00 


15 


ATOM 


322 


O ALA R 77 


-23.725-37.259 16.456 1.00 0.00 




ATOM 


323 


CB ALA R 77 


-24.619-35.075 18.644 1.00 0.00 




ATOM 


324 


H ALAR 77 


-23.001 -34.012 16.881 1.00 0.00 




ATOM 


325 


N VALR 78 


-23.455-38.052 18.556 1.00 0.00 




ATOM 


326 


CA VALR 78 


-22.704-39.281 18.232 1.00 0.00 


20 


ATOM 


327 


C VALR 78 


-23.295-40.388 17.344 1.00 0.00 




ATOM 


328 


O VALR 78 


-23.041 -41.583 17.543 1.00 0.00 




ATOM 


329 


CB VAL R 78 


-22.160-39.877 19.542 1.00 0.00 




ATOM 


330 


CG1 VAL R 78 


-21.189-38.911 20.224 1.00 0.00 




ATOM 


331 


CG2VALR 78 


-23.287-40.288 20.497 1.00 0.00 


25 


ATOM 


332 


H VALR 78 


-23.598-37.743 19.493 1.00 0.00 




ATOM 


333 


N CYSR 79 


-24.111 -40.011 16.349 1.00 0.00 




ATOM 


334 


CA CYSR 79 


-24.667-41.058 15.490 1.00 0.00 




ATOM 


335 


C CYS R 79 


-23.639-41.706 14.569 1.00 0.00 




ATOM 


336 


O CYS R 79 


-23.161 -41.097 13.611 1.00 0.00 


30 


ATOM 


337 


CB CYS R 79 


-25.854-40.535 14.695 1.00 0.00 




ATOM 


338 


SG CYSR 79 


-27.265-40.010 15.706 1.00 0.00 




ATOM 


339 


H CYS R 79 


-24.146-39.049 16.105 1.00 0.00 




ATOM 


340 


N ARG R 80 


-23.326-42.955 14.966 1.00 0.00 




ATOM 


341 


CA ARG R 80 


-22.220-43.688 14.354 1.00 0.00 


35 


ATOM 


342 


C ARG R 80 


-22.617-45.027 13.739 1.00 0.00 




ATOM 


343 


O ARG R 80 


-22.898-45.120 12.547 1.00 0.00 
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ATOM 


344 


CB ARG R 80 


-21.047-43.866 15.333 1.00 0.00 




ATOM 


345 


CG ARG R 80 


-19.958-42.787 15.285 1.00 0.00 




ATOM 


346 


CD ARG R 80 


-19.283-42.600 13.916 1.00 0.00 




ATOM 


347 


NE ARG R 80 


-18.739-43.844 13.361 1.00 0.00 


5 


ATOM 


348 


CZ ARG R 80 


-19.343-44.423 12.298 1.00 0.00 




ATOM 


349 


NH1 ARG R 80 


-18.915-45.588 11.814 1.00 0.00 




ATOM 


350 


NH2 ARG R 80 


-20.397-43.844 11.740 1.00 0.00 




ATOM 


351 


H ARG R 80 


-23.849-43.366 15.715 1.00 0.00 




ATOM 


352 


HE ARG R 80 


-17.952-44.277 13.804 1.00 0.00 


10 


ATOM 


353 1HH1 ARG R 80 


-19.409-46.064 11.082 1.00 0.00 




ATOM 


354 2HH1 ARG R 80 


-18.080-46.022 12.174 1.00 0.00 




ATOM 


355 1 HH2 ARG R 80 


-20.880-44.269 10.976 1.00 0.00 




ATOM 


356 2HH2 ARG R 80 


-20.749-42.984 12.102 1.00 0.00 




ATOM 


357 


N CYS R 81 


-22.630-46.086 14.566 1.00 0.00 


15 


ATOM 


358 


CA CYSR 81 


-22.946-47.400 14.002 1.00 0.00 




ATOM 


359 


C CYSR 81 


-23.527-48.331 15.049 1.00 0.00 




ATOM 


360 


O CYSR 81 


-22.830-49.080 15.728 1.00 0.00 




ATOM 


361 


CB CYSR 81 


-21.732-48.025 13.284 1.00 0.00 




ATOM 


362 


SG CYS R 81 


-22.046-48.494 11.547 1.00 0.00 


20 


ATOM 


363 


H CYS R 81 


-22.342-45.997 15.517 1.00 0.00 




ATOM 


364 


N ALAR 82 


-24.869-48.212 15.160 1.00 0.00 




ATOM 


365 


CA ALAR 82 


-25.712-49.047 16.033 1.00 0.00 




ATOM 


366 


C ALAR 82 


-25.041 -49.677 17.246 1.00 0.00 




ATOM 


367 


O ALAR 82 


-24.801 -50.881 17.310 1.00 0.00 


25 


ATOM 


368 


CB ALAR 82 


-26.390-50.148 15.212 1.00 0.00 




ATOM 


369 


H ALAR 82 


-25.322-47.577 14.537 1.00 0.00 




ATOM 


370 


N TYRR 83 


-24.697-48.786 18.189 1.00 0.00 




ATOM 


371 


CA TYR R 83 


-23.809-49.240 19.256 1.00 0.00 




ATOM 


372 


C TYRR 83 


-24.319-50.384 20.116 1.00 0.00 


30 


ATOM 


373 


O TYRR 83 


-25.492 -50.522 20.451 1.00 0.00 




ATOM 


374 


CB TYR R 83 


-23.268-48.059 20.087 1.00 0.00 




ATOM 


375 


CG TYR R 83 


-24.247-47.511 21.108 1.00 0.00 




ATOM 


376 


CD1 TYR R 83 


-25.207-46.556 20.715 1.00 0.00 




ATOM 


377 


CD2TYRR 83 


-24.147-47.951 22.445 1.00 0.00 


35 


ATOM 


378 


CE1 TYR R 83 


-26.058-46.003 21.688 1.00 0.00 




ATOM 


379 


CE2TYRR 83 


-25.001 -47.400 23.417 1.00 0.00 
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ATOM 380 CZ TYR R 83 -25.937-46.419 23.031 1.00 0.00 
ATOM 381 OH TYR R 83 -26.753-45.849 23.995 1.00 0.00 

ATOM 382 H TYR R 83 -25.061 -47.858 18.145 1.00 0.00 
ATOM 383 HH TYR R 83 -27.139-45.055 23.619 1.00 0.00 

5 ATOM 384 N GLY R 84 -23.359-51.236 20.478 1.00 0.00 

ATOM 385 CA GLY R 84 -23.783-52.411 21.232 1.00 0.00 

ATOM 386 C GLY R 84 -23.284-53.694 20.609 1.00 0.00 

ATOM 387 O GLY R 84 -23.189-54.732 21.259 1.00 0.00 

ATOM 388 H GLY R 84 -22.420-51.099 20.167 1.00 0.00 

10 ATOM 389 N TYR R 85 -22.925-53.551 19.317 1.00 0.00 

ATOM 390 CA TYR R 85 -21.764-54.346 18.941 1.00 0.00 

ATOM 391 C TYR R 85 -20.520-53.484 19.090 1.00 0.00 

ATOM 392 O TYR R 85 -20.632-52.266 19.248 1.00 0.00 

ATOM 393 CB TYR R 85 -21.930-54.959 17.539 1.00 0.00 

15 ATOM 394 CG TYR R 85 -21.890-53.946 16.413 1.00 0.00 

ATOM 395 CD1TYRR 85 -23.084-53.333 15.975 1.00 0.00 

ATOM 396 CD2TYRR 85 -20.649-53.671 15.805 1.00 0.00 

ATOM 397 CE1TYRR 85 -23.042-52.477 14.860 1.00 0.00 

ATOM 398 CE2TYRR 85 -20.609-52.814 14.697 1.00 0.00 

20 ATOM 399 CZ TYR R 85 -21.808-52.257 14.215 1.00 0.00 

ATOM 400 OH TYR R 85 -21.760-51.493 13.062 1.00 0.00 

ATOM 401 H TYR R 85 -23.172-52.734 18.790 1.00 0.00 

ATOM 402 HH TYR R 85 -20.964-51.753 12.596 1.00 0.00 

ATOM 403 N TYR R 86 -19.354-54.143 19.022 1.00 0.00 

25 ATOM 404 CA TYR R 86 -18.070-53.498 19.330 1.00 0.00 

ATOM 405 C TYR R 86 -17.547-52.466 1 8.319 1.00 0.00 

ATOM 406 O TYR R 86 -16.457-52.635 17.784 1.00 0.00 

ATOM 407 CB TYR R 86 -17.072-54.652 19.552 1.00 0.00 

ATOM 408 CG TYR R 86 -15.786-54.368 20.316 1.00 0.00 

30 ATOM 409 CD1TYRR 86 -15.322-53.061 20.597 1.00 0.00 

ATOM 410CD2TYRR 86 -15.053-55.502 20.719 1.00 0.00 

ATOM 411 CE1TYRR 86 -14.087-52.896 21.255 1.00 0.00 

ATOM 412CE2TYRR 86 -13.825-55.340 21.380 1.00 0.00 

ATOM 413 CZ TYR R 86 -13.346-54.040 21.627 1.00 0.00 

35 ATOM 414 OH TYR R 86 -12.110-53.911 22.244 1.00 0.00 

ATOM 415 H TYR R 85 -19.425-55.102 18.757 1.00 0.00 
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ATOM 416 HH TYR R 86 

ATOM 417 N GLN R 87 

ATOM 418 CA GLN R 87 

ATOM 419 C GLN R 87 

5 ATOM 420 O GLN R 87 

ATOM 421 CB GLN R 87 

ATOM 422 CG GLN R 87 

ATOM 423 CD GLN R 87 

ATOM 424 OE1 GLN R 87 

10 ATOM 425 NE2 GLN R 87 

ATOM 426 H GLN R 87 

ATOM 427 1HE2 GLN R 87 

ATOM 428 2HE2 GLN R 87 

ATOM 429 N ASP R 88 

15 ATOM 430 CA ASP R 88 

ATOM 431 C ASP R 88 

ATOM 432 O ASP R 88 

ATOM 433 CB ASP R 88 

ATOM 434 CG ASP R 88 

20 ATOM 435 OD1 ASP R 88 

ATOM 436 OD2ASPR 88 

ATOM 437 H ASP R 88 

ATOM 438 N GLU R 89 

ATOM 439 CA GLU R 89 

25 ATOM 440 C GLU R 89 

ATOM 441 O GLU R 89 

ATOM 442 CB GLU R 89 

ATOM 443 CG GLU R 89 

ATOM 444 CD GLU R 89 

30 ATOM 445 OE1 GLU R 89 

ATOM 446 OE2GLUR 89 

ATOM 447 H GLU R 89 

ATOM 448 N THR R 90 

ATOM 449 CA THR R 90 

35 ATOM 450 C THR R 90 

ATOM 451 O THR R 90 



-11.829-53.003 22.149 1.00 0.00 
-18.334-51.399 18.081 1.00 0.00 
-17.800-50.310 17.258 1.00 0.00 
-17.117-49.255 18.112 1.00 0.00 
-17.704-48.681 19.022 1.00 0.00 
-18.910-49.706 16.371 1.00 0.00 
-18.725-48.273 15.812 1.00 0.00 
-17.729-48.089 14.663 1.00 0.00 
-17.701-47.027 14.040 1.00 0.00 
-16.936-49.120 14.374 1.00 0.00 
-19.143-51.256 18.647 1.00 0.00 
-16.239-49.104 13.659 1.00 0.00 
-16.869-50.003 14.846 1.00 0.00 
-15.845-49.028 17.759 1.00 0.00 
-15.041 -48.001 18.429 1.00 0.00 
-14.442-47.101 17.361 1.00 0.00 
-14.123-47.592 16.276 1.00 0.00 
-13.968^*8.725 19.273 1.00 0.00 
-12.942-47.819 19.953 1.00 0.00 
-11.758-48.104 19.876 1.00 0.00 
-13.294-46.831 20.580 1.00 0.00 
-15.429-49.592 17.039 1.00 0.00 
-14.306-45.805 17.710 1.00 0.00 
-13.680-44.795 16.839 1.00 0.00 
-14.515-44.403 15.614 1.00 0.00 
-15.309-43.460 15.616 1.00 0.00 
-12.257-45.283 16.493 1.00 0.00 
-11.293-44.524 15.571 1.00 0.00 
-10.219-45.518 15.145 1.00 0.00 
-9.962-45.681 13.957 1.00 0.00 
-9.644-46.177 15.998 1.00 0.00 
-14.511 -45.638 18.676 1.00 0.00 
-14.293-45.184 14.553 1.00 0.00 
-14.096-44.591 13.232 1.00 0.00 
-15.313-44.151 12.444 1.00 0.00 
-16.239-44.933 12.212 1.00 0.00 
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ATOM 


452 


CB THR R 90 




ATOM 


453 


OG1 THR R 90 




•ATOM 


454 


CG2 THR R 90 




ATOM 


455 


H THR R 90 


5 


ATOM 


456 


HG1 THR R 90 




ATOM 


457 


N THR R 91 




ATOM 


458 


CA THR R 91 




ATOM 


459 


C THR R 91 




ATOM 


460 


O THR R 91 


10 


ATOM 


461 


CB THR R 91 




ATOM 


462 


OG1 THR R 91 




ATOM 


463 


CG2 THR R 91 




ATOM 


464 


H THR R 91 




ATOM 


465 


HG1 THR R 91 


15 


ATOM 


466 


N GLY R 92 




ATOM 


467 


CA GLY R 92 




ATOM 


468 


C GLY R 92 




ATOM 


469 


O GLY R 92 




ATOM 


470 


H GLY R 92 


20 


ATOM 


471 


N ARG R 93 




ATOM 


472 


CA ARG R 93 




ATOM 


473 


C ARG R 93 




ATOM 


474 


O ARG R 93 




ATOM 


475 


CB ARG R 93 


25 


ATOM 


476 


CG ARG R 93 




ATOM 


477 


CD ARG R 93 




ATOM 


478 


NE ARG R 93 




ATOM 


479 


CZ ARG R 93 




ATOM 


480 


NH1 ARG R 93 


30 


ATOM 


481 


NH2 ARG R .93 




ATOM 


482 


H ARG R 93 




ATOM 


483 


HE ARG R 93 




ATOM 


484 1HH1 ARG R 93 




ATOM 


485 2HH1 ARG R 93 


35 


ATOM 


486 1HH2 ARG R 93 




ATOM 


487 2HH2 ARG R 93 



-13.307-45.598 12.397 1.00 0.00 
-12.719-46.613 13.229 1.00 0.00 
-12.269-44.936 11.490 1.00 0.00 
-14.031 -46.141 14.672 1.00 0.00 

-11.932-46.206 13.613 1.00 0.00 
-15.264-42.906 11.946 1.00 0.00 
-16.086-42.598 10.774 1.00 0.00 
-15.551 -43.214 9.474 1.00 0.00 
-15.041 -42.576 8.553 1.00 0.00 
-16.263-41.089 10.611 1.00 0.00 
-15.526-40.320 11.595 1.00 0.00 
-17.753-40.737 10.608 1.00 0.00 
-14.546-42.271 12.218 1.00 0.00 

-16.170-40.159 12.292 1.00 0.00 
-15.638-44.548 9.437 1.00 0.00 
-14.952-45.240 8.341 1.00 0.00 
-14.947.46.747 8.485 1.00 0.00 
-15.038-47.506 7.517 1.00 0.00 
-16.190-45.008 10.134 1.00 0.00 
-14.906-47.132 9.766 1.00 0.00 
-15.165-48.526 10.101 1.00 0.00 
-16.597-48.659 10.561 1.00 0.00 
-17.179-47.725 11.108 1.00 0.00 
-14.257-49.027 11.231 1.00 0.00 
-12.866-49.498 10.801 1.00 0.00 
-12.088-50.205 11.924 1.00 0.00 
-11.530-49.278 12.906 1.00 0.00 
-11.885-49.270 14.210 1.00 0.00 
-11.321 -48.407 15.046 1.00 0.00 
-12.806-50.108 14.666 1.00 0.00 
-14.866-46.429 10.469 1.00 0.00 
-10.826-48.644 12.583 1.00 0.00 
-11.678-48.273 15.969 1.00 0.00 
-10.552-47.825 14.774 1.00 0.00 
-13.112 -50.146 15.616 1.00 0.00 
-13.266-50.761 14.056 1.00 0.00 
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ATOM 


488 


N CYS R 94 


-17.121 -49.864 10.349 1.00 0.00 




ATOM 


489 


CA CYS R 94 


-18.327-50.307 11.053 1.00 0.00 




ATOM 


490 


C CYS R 94 


-18.187-51.790 11.280 1.00 0.00 




ATOM 


491 


O CYS R 94 


-18.340-52.598 10.361 1.00 0.00 


5 


ATOM 


492 


CB CYS R 94 


-19.611 -50.063 10.264 1.00 0.00 




ATOM 


493 


SG CYS R 94 


-20.314-48.399 10.429 1.00 0.00 




ATOM 


494 


H CYS R 94 


-16.640-50.508 9.754 1.00 0.00 




ATOM 


495 


N GLU R 95 


-17.798-52.106 12.524 1.00 0.00 




ATOM 


496 


CA GLU R 95 


-17.382-53.477 12.826 1.00 0.00 


10 


ATOM 


497 


C GLU R 95 


-18.334-54.564 12.388 1.00 0.00 




ATOM 


498 


O GLU R 95 


-19.545-54.380 12.270 1.00 0.00 




ATOM 


499 


CB GLU R 95 


-17.063-53.650 14.313 1.00 0.00 




ATOM 


500 


CG GLUR 95 


-15.587-53.436 14.658 1.00 0.00 




ATOM 


501 


CD GLUR 95 


-15.185-51.998 14.416 1.00 0.00 


15 


ATOM 


502 


OE1 GLU R 95 


-14.757-51.672 13.318 1.00 0.00 




ATOM 


503 


OE2GLUR 95 


-15.280-51.193 15.328 1.00 0.00 




ATOM 


504 


H GLU R 95 


-17.570-51.381 13.172 1.00 0.00 




ATOM 


505 


N ALA R 96 


-17.715-55.723 12.129 1.00 0.00 




ATOM 


506 


CA ALAR 96 


-18.517-56.877 11.727 1.00 0.00 


20 


ATOM 


507 


C ALA R 96 


-18.962-57.733 12.900 1.00 0.00 




ATOM 


50B 


O ALA R 96 


-20.158-57.994 13.010 1.00 0.00 




ATOM 


509 


CB ALA R 96 


-17.748-57.740 10.723 1.00 0.00 




ATOM 


510 


OXTALAR 96 


-18.116-58.127 13.704 1.00 0.00 




ATOM 


511 


H ALA R 96 


-16.728-55.754 12.290 1.00 0.00 


25 


END 









'Principal residues are bolded. 
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Appendix 4. The Fortran-77 source code of the VBMC algorithm.* 
SUBROUTINE MONTE(XMIN,N,S0) 

C* 

5 c* Variable Basis Monte Carlo simulated annealing: 
C* XMIN - initial configurational variables and 
C* final variables for subsequent local energy 

C* refinement. 

C* N - the number of configurational variables. 
10 C* SO - initial energy. 

C* FUN - potential energy function. 
C* 

IMPLICIT REAL*8(A-H.O-Z) 
DIMENSION XX(250),X(250),XMIN(1) 
15 DIMENSION Y(500,250),C(250,250),E(250),V(250.250) 

IYES=0 
ITEST=0 

READ(5.1)N1,N4,N5.NG1,ITE,IPR1.N33 
IF(N1.EQ.0)RETURN 
20 WRITE(6.3000)lTE.S0 
TT=ITE 

3000 FORMAT(/1 OX/INITIAL TEMPERATURE', 17, 

1 ' INITIALENERGY'.FI 5.6/) 
PR1=IPR1 

25 PR2=PR1*2.D0 

WRITE(6,210)PR1,PR1 
210 FORMATC RANDOM TORSIONAL INCREMENT -' 

2 ,F5.2,' TO +-.F5.2; GRAD') 
WRITE(6,302) 

30 302 FORMAT(1 0X,'*VARIABLE BASIS MC SIMULATED ANNEALING*'/) 

C" Boltzmann factor. 

T=-1./(0.001987*TT) 
C *** TEMPERATURE T=ITE *** 
1 FORMAT(15l5) 
35 N1=N°N1 
N3=N 
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K=N 

C* Cooling schedule. 
DT=(ITE-120)/(N5*N3) 
NG=NG1 
5 IF(NG.EQ.0)NG=1 

C* Initial set of random number generator. 
CALL SRAND(NG) 
WRITE(6,2)N1.N3,N4,N5,NG 
2 FORMAT(/5X,'MONTE-CARLO PARAMETERS:'/ 
10 *5X, , BLINDCUT\I10/ 
*5X/BYPASS\I10/ 

*5X,'FREQUENCY OF CONTROL OUTPUTJ10/ 
•5X/LENGTH OF ANNEALING REGION',110/ 
•5X/INITIAL SET OF THE RANDOM GENERATOR ',110/) 
15 IBEGIN=0 

MNCARL=0 

LQ=0 

17=0 

L1=0 

20 EMIN=1.D37 
S3=0. 
S4=0. 

C* Initial set of basis of independent variables. 
DO 88 1=1, N 
25 D0 88J=1,N 

V(l,J)=0.D0 
IF(LEQ.J)V(I,J)«1.D0 
88 CONTINUE 
DO 2005 1=1, N 
30 2005 X(I)=XMIN(I) 
264 S1=0. 
S2=0. 
L=0 

C° Method Monte Carlo. 
• 35 |YES=0 
KG=0 
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L2=0 
1=0 
26 1=1+1 

C* Random choice of a variable to move. 
5 H=RAND() 

IK=IDINT(H*N)+1 

RA=RAND() 

D=RA*PR2-PR1 

C* Variable Basis Markov chain: move along basis vectors. 
10 D0 513IN1.N 

513 XX(II)=X(II)+V(II,IK)*D 

CALL FUN(N.XX.SS) 
G* SS - potential energy of the configuration XX, 

S=SS-S0 
1 5 BI=S*T 

IF(Bl.GT.50.D0) GOTO 263 
C* CALCULATION OF TRANSITION PROBABILITY 

IF(BI.LT.-50.D0) GOTO 262 

B=DEXP(BI) 
20 W7=B/(B+1.D0) 

C* DOES TRANSITION TAKE PLACE ? 

Z=RAND() 

IF(W7-2)262,263,263 
C* ** NO ** 
25 262 CONTINUE 
GOTO 274 
C* " YES ! ** 
263 S0=SS 
IYES=IYES+1 
30 DO 1314 11=1, N 

1314 X(II)=XX(II) 
C* Blind cut off 
274 IF(IBEGIN.EQ.I) GOTO 266 
MNCARL=ivlNCARL+1 
35 IF(MNCARL-N1)26,1264,265 
1264 WRITE(6,1265)MNCARL,S0 

114 

SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9B06048A2_I_> 



WO 98/06048 



PCT/CA97/00539 



1265 FORMATCInitial point has been cut ofr,!10,1PD17.9) 
C* Cut off upon criteria. 
GOTO 264 
265 S1=S1+S0 
5 S2=S2+S0*S0 
L=L+1 

IF(L.NE.20*N) GOTO 26 
L=0 

SR1=S1/I 
10 DISP1=S2/I-SR1"2 

IF(DABS(SR1-S0)-DSQRT(DISP1))270,270,271 
271 ITEST=0 
GOTO 272 
270 ITEST=ITEST+1 
15 272 YES=(1.D2*IYES)/I 

WRITE(6.268)I,S0,SR1 .DISP1 .ITEST, YES 

268 FORMAT('Cut off ,I8,3D14.7,I3,2PF7.1) 
IF(I.GE.50000)GOTO 267 
IF(ITEST-3) 26,267,267 

20 267 IBEGIN=1 

WRITE(6,1265)MNCARL.S0 
WRITE(6,269) 

269 FORMATf/MSX.'Equilibrium has been reached 7) 
GOTO 264 

25 C* N3 - parameter of bypassing. 
C* DT - cooling schedule. 
266 L=L+1 

IF(TT.LE.120.D0)GOTO 3333 
C* Slow cooling. 
30 TT=TT-DT 

C* Boltzmann factor. 
T=-1.D07(1.987D-3*TT) 
IF(L.LT.N3)GOT0 26 
L=0 

_ 35 KG=KG+1 

C* Accumulating N33 sequential points of the Markov chain. 
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DO 1992 J=1.N 
1992 Y(KG,J)=X(J) 
L1-L1+1 
L2=L2+1 

5 IF(S0.GE.EMIN)GOTO 1957 

EMIN=S0 
DO 1961 IJ=1,N 
1961 XMIN(IJ)=X(IJ) 
LQ=LQ+1 

1 0 WRITECe.lOOJLQ.I.EMIN.TT.fXMINfJJ.Jsl ,N) 

100 FORMATS LQ\2I7,D17.9,F7.1/(1X,10F12.3)) 
447 FORMAT(/' CR\2I7,D17.9,F7.1/(1X,10F12.3)) 
1957 IF(KG.LT.N33)GOTO 45 

C* Computing covariant matrix of the distribution of 
15 C* the N33 accumulated points in the configurational 
C* space. 
KG=0 

DO 71 ll=1,K 

YYY=0.0 
20 D0 82J=1,N33 

82 YYY=YYY+Y(J,II) 

YYY=YYY/N 

DO 54 J=1,N33 
54 Y(J,ll)=Y(J,lt)-YYY 
25 71 CONTINUE 

DO 710 11=1 ,K 

DO710J=1,K 

S=0.0 

DO 711 LL=1,N33 
30 711 S=S+Y(LL.II)*Y(LL,J) 

710 C(II,J)=S 
C* Covariant matrix is ready. 

C° Roration of basis V: 
CALL SDIA62(250,K,C,E,V) 
35 45 IF(L2.LT.N4) GOTO 26 
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L2=0 

YES=(1.D2*IYES)/I 

WRITE(6,72)I,L1.EMIN,S0,TT,YES 
72 FORMATC CO\2I10,2D15.6,F7.1,2PF7.1) 

IF(L1.LT.N5) GOTO 26 
3333 WR!TE(6,1111) 

1111 FORMAT(//10X,'END OF MONTE CARLO STAGE') 
• RETURN 
END 

SUBROUTINE SDIAG2(M,N,A,D,X) 

C" 

C* Diagonalization of symmetric matrix A (NxN). 
G* Results: D - eigenvalues; X - eigenvectors. 
C* 

IMPLICIT REAL*8(A-H.O-Z) 

DIMENSION A(M,M),D(M),X(M,M) 

DIMENSION E(250) 

EPS=1.0D-14 

TOL=1.0D-30 

IF(N.EQ.1)GOTO 400 

DO 10 1=1. N 

DO 10 J=1,l 
10X(I,J)=A(I,J) 

DO 150 NI=2,N 

ll=N+2-NI 

DO 150 l=ll,ll 

L=l-2 

H= 0D0 

G=X(I,I-1) 

IF(L)140,140,20 
20 DO 30 K=1,L 
30 H=H+X(I,K)**2 

S=H+G*G 

IF(S.GE.TOL)GOTO 50 

H=.0D0 

GOTO 140 
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50 IF(H)140. 140,60 
60 L=L+1 
F=G 

G=DSQRT(S) 
5 IF(F)75.75,70 
70 G=-G 
75 H=S-F«G 
X(I.1-1)=F-G 
F=. 0D0 
10 DO110J=1,L 
X(J,I)=X(I.J)/H 
S=OD0 
DO 80 K=1,J 
80 S=S+X(J,K)'X(I.K) 
15 J1=J+1 

IF(J1.GT.L)GOTO 100 
DO 90 K=J1,L 
90 S=S+X(K,J)*X(i,K) 
100 E(J)=S/H 
20 110 F=F+S*X(J,I) 

F=F/(H+H) 
DO 120 J=1,L 
120 E(J)=E(J)-F*X(I,J) 
DO 130 J=1,L 
25 F=X(I,J) 
S=E(J) 

DO 130 K=1,J 
130 X(J,K)=X(J,K)-F*E(K)-X(I,K)-S 
140 D(I)=H 
30 150E(I-1)=G 

160 D(1)=X(1,1) 

X(1,1)=1.D0 

DO 220 l=2.N 

L=l-1 

35 IF(D(I))200,200,170 
170 DO 190 J=1,L 
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S=. 0D0 
DO180K=1,L 
180 S=S+X(I,K)*X(K,J) 
DO 190 K=1,L 
5 190X(K,J)=X(K,J)-S*X(K,I) 
200 D(I)=X(I,I) 
X(I,I)=1.D0 
210 DO 220 J=1,L 
X(l,J)=.0D0 
10 220 X(J,l)=.0D0 

B=.0D0 
F=.0D0 
E(N)=.0D0 
' DO340L=1.N 
1 5 H=EPS*(DABS(D(L))+DABS(E(L))) 
IF(H.LE.B)GOTO 235 
B=H 

235 DO 240 J=L,N 

IF(DABS(E(J)).LE.B)GOTO 250 
20 240 CONTINUE 

250 IF(J.EQ.L)GOTO 340 
260 P=(D(L+1)-D(L))\5D0/E(L) 

R=DSQRT(P*P+1.D0) 

IF(P)270,280,280 
25 270 P=P-R 

GOTO 290 
280 P=P+R 
290 H=D(L)-E(L)/P 

DO 300 l=L,N 
30 300 D(I)=D(I)-H 

F=F+H 

P=D(J) 

C=1.D0 

S=0.D0 

, 35 J 1 = J- 1 

DO330 NI=L,J1 

119 

SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 980604BA2J_> 



WO 98/06048 



PCT/CA97/00539 



ll=L+J1-NI 
DO 330 l=ll,ll 
G=C*E(I) 
H=C*P 

5 IF(DABS(P).LT.DABS(E(l)))GOTO 310 

C=E(I)/P 

R=DSQRT(C*C+1.D0) 

E(I+1)=S*P*R 

S=C/R 
10 C=1.D0/R 

GOTO 320 
310 C=P/E(I) 

R=DSQRT(C*C+1.D0) 

E(I+1)=S*E(I)*R 
15 S=1.D0/R 

C=C/R 
320 P=C*D(I)-S*G 

D(I+1)=H+S*(C*G+S*D(I)) 

DO 330 K=1,N 
20 H=X(K,I+1) 

X(K,I+1)=X(K,I)*S+H*C 
330 X(K,I)=X(K,I)*C-H*S 

E(L)=S*P 

D(L)=C*P 

25 IF(DABS(E(L)).GT.B)GOTO 260 

340 D(L)=D(L)+F 

NI=N-1 
350 DO 380 1=1, Nl 

K=l 

30 P=D(I) 
J1=l+1 

DO 360 J=J1,N 
IF(D(J).GE.P)GOTO 360 
K=J 

35 P=D(J) 

360 CONTINUE 
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IF(K.EQ.l)GOTO 380 

D(K)=D(I) 

D(I)=P 

DO 370 J=1,N 
5 P=X(J,I) 

X(J,I)=X(J,K) 
370 X(J,K)=P 
380 CONTINUE 

RETURN 
10 400 D(1)=A(1,1) 

X(1,1)=1.D0 

RETURN 

END 



•The essential places of the VBMC algorithm (different from conventional Monte Carlo 
technique) are bolded. 
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WHAT IS CLAIMED IS: 

1 A variable basis Monte-Carlo (VBMC) simulated annealing method for identifying an 
optimal structure in a molecular system, comprising. 
5 (a) providing a Markov chain with an initial basis set of N configurational variables 

which define a structure for said molecular system; 

(b) translating the N configurational variables simultaneously along a basis vector 
to produce a new structure for the molecular system, wherein magnitude of translation is 
randomly chosen within a preselected range; 
1 0 ( C ) calculating potential energy of the new structure of the molecular system; 

(d) deciding whether to accept or reject the new structure using an effective 
temperature dependent transition function, and if the new structure is accepted, replacing the 
current structure with the accepted structure; 

(e) repeating steps (b) to (d), each repetition having a new basis vector, for a 
1 5 preselected first number of repetitions at a preselected upper temperature; then 

(f) decreasing temperature according to a preselected cooling schedule; 

(g) repeating steps (b) to (d) and (f), each repetition having a new basis vector, for 
a preselected second number of repetitions; then 

(h) storing current structure of the molecular system; 

20 (|) repeating steps (g) and (h) until a preselected number of stored structures 

have been accumulated; then 

(j) rotating the basis set of configurational variables according to an effective 
distribution of the accumulated structures so that basis vectors are directed along low-energy 
valleys of a potential energy hypersurface thereby accelerating conformational motions and 
25 structural transitions of the molecular system, and erasing accumulated structures from steps 
(h) and (I); 

(k) repeating steps (I) and 0) until, after an effective third number of repetitions, a 
preselected lower temperature limit is reached, thereby producing an unrefined global 

minimum structure; and 
30 (|) refining the global energy minimum structure so that at least one local 

minimum on the potential energy hypersurface is identified. 

2. The method of claim 1 , wherein the basis vector is chosen randomly within said 
preselected range. 



35 



The method of claim 1 , wherein the basis vector is chosen deterministically within said 
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preselected range. 

4. The method of claim 1 wherein the preselected second number of repetitions is N or a 
multiple thereof. 

5. The method of claim 1 wherein the preselected second number of repetitions is at 
least about 2N. 

6 The method of claim 1 wherein the refining step is non-linear minimization. 

7 The method of claim 6, wherein the refining step is gradient descent minimization. 

8. The method of claim 6, wherein the refining step is conjugate gradient minimization. 

9. The method of claim 6, wherein the refining step is a quasi-Newton minimization 
method. 

10. The method of claim 9, wherein the quasi-Newton method is Newton -Raphson 
method. 

11. The method of claim 6, wherein the refining step is Marquardt method. 

12. The method of claim 6, wherein the refining step is a variable metric method. 

13. The method of claim 6, wherein the refining step is Powell method. 

14. The method of claim 1 wherein said molecular system is at least one peptide domain 
comprising amino acid residues, each amino acid residue comprising a main chain having a- 
carbon atoms, carbonyl carbon atoms and nitrogen atoms, said basis set of conf.gurational 
variables comprising torsional angles between said a-carbon atoms and neighbouring 
carbonyl carbon atoms, and torsional angles (<t>.) between said a-carbon atoms and 
neighbouring nitrogen atoms. 

15. The method of claim 14 wherein torsional angles describing planar portions of said 
peptide domains are held constant. 
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The method of claim 14 wherein said effective temperature dependent transition 
function for screening a new state x*, derived from a prev.ous state x, is given by: 

W(x, - x,.,) = 1/(1 + exp(AE, /kT)] 
where AE, is the difference between conformational energ.es of the states x,. , and x.. k is the 
Boltzmann constant and T is absolute temperature. 



17 The method of claim 14 wherein said effective temperature dependent transition 
function for screening a new state x„ derived from a previous state x, is given by: 

W(x,-x, l )«1. ifE(x, 1 )_<E(W(x,) 
= exp(-AE,/kT), if E(x lM ) * E(W(x.) 
where AE, is the difference between conformational energies of the states x„, and x,. k is the 
Rnit^mann constant and T is absolute temperature. 



method of claims 14, 15. 16 or 17 wherein said preselected upper temperature is 



18. The 
about 1000°K. 

19. The method of claim 



18 wherein said preselected lower temperature is about 120°K. 



20. A method of identifying structures of a peptide domain of a neurotropic to bind with a 

binding site of a receptor, comprising; 

a) providing a plurality of neurotrophin analogues wherein at least one 
neurotrophin analogue binds said receptor; 

b) identifying an optima, or near-optima, structure for respective peptide doma.ns 
of the neurotrophin analogues that bind said receptor by app.y,ng a prese.ected number of the 
variable bas.s Monte-Carlo (VBMC) annealing simulations according to claim 1 to each of the 
respective peptide domains to identify minimum or near- 
peptide domains; and 

c) comparing said minimum or 
common structural features shared by neurotrophin analogues exhibiting receptor mediated 

activity. 

21 The method according to Cairn 20 wherein said common structural features are 
absent from neurotrophin analogues that do not exhibit receptor-mediated act.vrty. 



r-minimum energy structures for said 
near-minimum energy structures to identify 



35 22. A method of identifying structures 



of a domain of a ligand to bind with a binding site of 
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a receptor, comprising; 

a) providing a plurality of ligand analogues wherein at least one ligand analogue 

binds said receptor, 

b) identifying an optimal or near-optimal structure for respective domains of the 
5 ligand analogues that bind said receptor by applying a preselected number of the variable 

basis Monte-Carlo (VBMC) annealing simulations according to claim 1 to each of the 
respective domains to identify minimum or near-minimum energy structures for said domains, 
and 

c) comparing said minimum or near-minimum energy structures to identify 
1 0 common structural features shared by ligands exhibiting receptor mediated activity. 

23. The method according to claim 22 wherein said common structural features are 
absent from ligand analogues that do not exhibit receptor-mediated activity. 

1 5 24. The method according to claim 20 wherein said neurotrophin is NGF, and said 
receptor is TrkA. 

25. The method according to claim 20 wherein said neurotrophin is brain-derived 
neurotrophic factor (BDNF), neurotrophin-3 (NT-3), or neurotrophin-4 (NT-4), and said 

20 receptor is TrkB. 

26. The method according to claim 20 wherein said neurotrophin is neurotrophin-3 (NT-3) 
and said receptor is TrkC. 

25 27. The method according to claim 20 wherein said receptor is the common neurotrophin 
receptor p75 NTR . 

28. A ligand for binding with TrkA, wherein TrkA comprises a leucine rich motif (LRM) 
comprising amino acid residues 93 to 117. wherein coordinates of the residues of said LRM 

30 are given in Appendix 1 . said LRM having five binding areas, area A comprising amino acid 
residue Phe 10SW capable of hydrophobic interaction, area B comprising amino acid residues 
Phe 111tA , Phe 113 *, and Thr 11 ™ capable of hydrophobic interaction, area C comprising amino 
acid residues Asp 10aA and His 11 ™ capable of ionic interaction, area D comprising amino acid 
residue Lys 10mA capable of ionic interaction, and area E comprising Asn 95 * to lie 98 * capable of 

35 multiple parallel fJ-strand type hydrogen bonding, the ligand comprising; 

at least three moieties each compnsing effective atomic elements having spatial 
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occupancy, relative atomic position, bond type and charge to define a three dimensional 
conformation of said ligand so that a first moiety can bind with a first binding area, a second 
moiety can bind with a second binding area, and a third moiety can bind with a third binding 



area. 



29 The ligand according to claim 28 wherein the first moiety can hydrophobically bond 
with binding area A, the second moiety can ionically bind with binding area C, and the third 
moiety can ionically bind with binding area D. 

1 0 30. The ligand according to claim 29 comprising a fourth moiety effectively spaced with 
respect to the other moieties so it can hydrophobically bond with binding area B. 

31 The ligand according to claim 29 comprising a fourth moiety effectively spaced with 
respect to the other moieties so it can hydrogen bond with binding area E. 

15 

32 The ligand according to claim 29 wherein the first moiety is capable of hydrophobic 
bonding to Phe 105 * in binding area A, the second moiety is positively charged and present in 
an effective position in said ligand to ionically interact with Asp' 09 " and His 1 «* in binding area 
C, and the third moiety is negatively charged and present in an effective position in said 

20 ligand to ionically interact with Lys 10aA in binding area D. 

33 The ligand according to claim 32 wherein the moiety hydrophobically bonded to 
Phe ,06tA in binding area A is Trp, wherein the positively charged moiety which ionically 
interacts with Asp 109 * and His 11JlA in binding area C is Arg and Lys. and wherein the negatively 

25 charged moiety which ionically interacts with Lys 100 * in binding area D is Asp. 

34. The ligand according to claim 32 wherein said first, second and third moieties are 
amino acids or isosteres thereof. 

30 35- The ligand according to claim 30 wherein said fourth moiety is an amino acid or 
isostere thereof. 

36. The ligand according to claim 31 wherein said fourth moiety is an amino acid or 

isostere thereof. 



37. A method of designing a ligand to bind with a LRWI of TrkA, comprising: 



134 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 9806048A2J_> 



WO 98/06048 



PCT/CA97/00539 



providing a template of said LRM of TrkA. said LRM comprising amino acid residue 
sequence 93 to 117 inclusive, said amino acid residues having spatial Cartesian coordinates 
given in Appendix 1 wherein said binding area A comprises Phe 105lA capable of hydrophobic 
bonding, binding area B comprises Phe 111tA , Phe 113tA , Thr 114tA capable of hydrophobic bonding, 
5 binding area C comprises Asp 109tA and His t1?lA capable of ionic interaction, binding area D 

comprises Lys 100tA capable of ionic interaction, and binding area E comprises Asn 95 * to lie 98 * 
capable of hydrogen bonding; and 

computationally evolving a ligand using an effective algorithm so that said evolved 
ligand comprises at least three effective moieties located relative to each other in the ligand 
10 so that a first moiety can bind with a first of said five binding areas, a second moiety can bind 
with a second of said five binding areas, and a third moiety can bind with a third of said 
binding areas. 

38. The method according to claim 37 including the step of generating said ligand under a 
1 5 constraint that the evolved ligand comprises said first effective moiety spatially located to 
hydrophobically bind with binding area A, said second moiety spatially located to ionically 
interact with binding area C, and said third moiety spatially located to ionically interact with • 
binding area D. 

20 39. The method according to claim 37 including the step of generating said ligand under a 
constraint that the evolved ligand comprises an effective moiety spatially located to 
hydrophobically interact with binding area B. 

40. The method according to claim 37 including the step of generating said ligand under a 
25 constraint that the evolved ligand comprises an effective moiety spatially located to hydrogen 

bond with binding area E. 

41 . A ligand for binding with TrkB t wherein TrkB comprises a leucine-rich motif (LRM) 
comprising amino acid residues 93 to 117, the coordinates of the residues of said LRM given 

30 in Appendix 2, said LRM having five binding areas, binding area F comprising amino acid 
residue Asp 100IB capable of ionic interaction, binding area G comprising amino acid residue 
Lys t09tB capable of ionic interaction, binding area H comprising amino acid residue Lys 113tB 
capable of ionic interaction, binding area I comprising amino acid residue Arg 9418 capable of 
ionic interaction, and binding area K comprising amino acid residue Lys 104tB capable of 

35 hydrogen bonding or ioniic interaction, the ligand comprising: 

at least three moieties each comprising effective atomic elements having spatia! 
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occupancy, relative atomic position, bond type and charge to define a three dimensional 
conformation of said ligand so that a first moiety can bind with a first binding area, a second 
moiety can bind with a second binding area, and a third moiety can bind with a third binding 
area. 

42. The ligand according to claim 41 wherein one of said at least three moieties can 
ionically bind with binding area F, another of said at least three moieties can ionically bind 
with binding area G, and another of said at least three moieties can form a hydrogen bond or 
ionically bind with binding area K. 

43. The ligand according to claim 42 wherein said ligand includes a fourth moiety 
effectively spaced relative to said at least three moieties so that it can ionically bind with 
binding area H. 

44. The ligand according to claim 42 wherein said ligand includes a fourth moiety 
effectively spaced relative to said at least three moieties so that it can ionically bind with 
binding area I. 

45. The ligand according to claim 41 wherein the first moiety is positively charged and 
present in an effective position in said ligand to ionically bind to Asp 100te in binding area F, the 
second moiety is negatively charged and present in an effective position in said ligand to 
ionically interact with Lys 109tB in binding area G, and the third moiety is neutral or negatively 
charged and capable of hydrogen bonding or ionically bonding to Lys 104lB in binding area K. 

46. The ligand according to claim 45 wherein a fourth moiety is negatively charged and 
present in an effective position in said ligand to ionically bind to Lys 113tB in binding area H, and 
wherein a fifth moiety is negatively charged and present in an effective position in said ligand 
to ionically bind to Arg*" B in binding area I. 

47. The ligand according to claim 46 wherein the positively charged moiety ionically 
bonded to Asp 100tB in binding area F is Arg, wherein the charged moiety which ionically 
interacts with Lys 109,B in binding area G is Glu and Lys, and wherein the negatively charged 
moiety ionically bonded with Lys 113tB in binding area H is Asp. 

48. The ligand according to claim 47 wherein the negatively charged amino acid residue 
ionicaiiy bonded to Arg 9419 in binding area I is Glu. 
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49. The ligand according to claim 45 wherein the neutral or negatively charged moiety 
capable of hydrogen bonding or ionically bonding to Lys 1M,B in binding area K is Thr. 

50. The ligand according to claim 45 wherein said first, second and third moieties are 
amino acids or isosteres thereof. 

51. The ligand according to claim 46 wherein said fourth and fifth moieties are amino 
acids or isosteres thereof. 

52. A method of designing a ligand to bind with TrkB, wherein TrkB comprises a leucine- 
rich motif (LRM) comprising amino acid residues 93 to 117, the coordinates of the residues of 
said LRM given in Appendix 2, said LRM having five binding areas, binding area F comprising 
amino acid residue Asp 100lB capable of ionic interaction, binding area G comprising amino acid 
residue Lys 109tB capable of ionic interaction, binding area H comprising amino acid residue 
Lys ii3tB ca p a 5i e D f j 0n i C interaction, binding area I comprising amino acid residue Arg 9416 
capable of ionic interaction, and binding area K comprising amino acid residue Lys 104!B 
capable of hydrogen bonding or ionic interaction, the method comprising: 

computationally evolving a ligand using an effective algorithm so that said evolved 
ligand comprises at least three effective moieties located relative to each other in the ligand 
so that a first moiety can bind with a first of said five binding areas, a second moiety can bind 
with a second of said five binding areas, and a third moiety can bind with a third of said 
binding areas. 

53. A ligand for binding with the common neurotrophin receptor p75 NTR , wherein p75 NTR 
comprises a binding site including amino acid residues Cys 39p to Cys 94 " inclusive, said 
residues of said binding site having spatial coordinates given in Appendix 3 t said binding site 
having three binding areas including binding loop 2A comprising region Cys 39 * to Cys 56 * 
capable of attractive electrostatic interaction, binding loop 2B comprising region Cys 5ap to 
Cys 79p capable of attractive electrostatic interaction, and binding loop 3A comprising region 
Cys 79p to Cys 94 * capable of attractive electrostatic interaction, the ligand comprising; 

at least two moieties each comprising effective atomic elements having spatial 
occupancy, relative atomic position, bond type and charge to define a three dimensional 
conformation of said ligand so that a first moiety can bind with a first binding area, and a 
second moiety can bind with a second binding area. 

54. The iigand according to claim 53 wherein said ligand is an agonist, and wherein one 
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of said at least two moieties binds to loop 2B. 

55. The ligand according to claim 53 further comprising a third moiety wherein said first 
moiety can bind to loop 2A, said second moiety can bind to loop 2B and said third moiety can 

5 bind to loop 3A. 

56. The ligand according to claim 53 wherein said first moiety binds to any two of 
Asp 47p , Lys 56 * and Glu 53p . 

1 o 57. The ligand according to claim 53 wherein said binding loop 2A includes amino acid 
residues Asp 47p and Lys 56 * and said first moiety ionically binds to said residues Asp 47p and 
Lys 5 * 3 

58. The ligand according to claim 53 wherein said binding loop 2B includes amino acid 

1 5 residues Asp 75p and Asp 76p and said second moiety ionically binds to said residues Asp 75p and 
Asp 76p 

59. The ligand according to claim 55 wherein said binding loop 3A includes amino acid 
residues Asp 880 and Glu 89p and said third moiety ionically binds to said residues Asp 880 and 

20 Glu 8 * 

60. The ligand according to claim 55 wherein said third moiety binds to any two of 
Asp 8815 , Glu** and Arg** 

25 61 A method of designing a ligand for binding with the common neurotrophin receptor 
p75 NTR , wherein p75 NTR comprises a binding site including amino acid residues Cys 39p to 
Cys wp inclusive, said residues of said binding site having spatial coordinates given in 
Appendix 3 t said binding site having three binding areas including binding loop 2A comprising 
region Cys 39p to Cys Mp capable of attractive electrostatic interaction, binding loop 2B 

30 comprising region Cys 5 ^ to Cys 79p capable of attractive electrostatic interaction, and binding 
loop 3A comprising region Cys 79p to Cys 94p capable of attractive electrostatic interaction, the 
method comprising; 

computationally evolving a ligand using an effective algorithm so that said evolved 
ligand comprises at least two effective moieties located relative to each other in the ligand so 
35 that a first moiety can bind to a first of said three loops and a second moiety can bind to a 
second loop of said three loops. 
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62. A method of identifying a ligand to bind with a leucine rich motif (LRM) of TrkA, 
comprising: 

providing a three dimensional conformation for said LRM, said LRM comprising amino 
acid residue sequence 93 to 117 inclusive, said amino acid residues having spatial Cartesian 
coordinates given in Appendix 1. said LRM having five binding areas, area A compnsing 
amino acid residue Phe 105lA capable of hydrophobic interaction, area B comprising amino acid 
residues Phe 111tA , Phe 113IA , and Thr UM capable of hydrophobic interaction, area C comprising 
amino acid residues Asp 109 * and His ,12tA capable of ionic interaction, area D comprising amino 
acid residue Lys 100IA capable of ionic interaction, and area E comprising Asn 95 * to lle MtA 
capable of multiple parallel p-strand type hydrogen bonding; 

providing a data base containing molecules coded for spatial occupancy, relative 
atomic position, bond type and/or charge; and 

screening said data base to select those molecules comprising moieties having a 
three dimensional conformation and effective charge that can bind with at least three of said 
five binding areas. 

63. A method of identifying a ligand to bind with a leucine rich motif (LRM) of TrkB. 
comprising: 

providing a three dimensional conformation for said LRM. said LRM comprising amino 
acid residues 93 to 1 17. the coordinates of the residues of said LRM given in Appendix 2. 
said LRM having five binding areas, binding area F comprising amino acid residue Asp 100,B 
capable of ionic interaction, binding area G comprising amino acid residue Lys ,091B capable of 
ionic interaction, binding area H comprising amino acid residue Lys 1,3tB capable of ionic 
interaction, binding area I comprising amino acid residue Arg M,B capable of ionic interaction, 
and binding area K comprising amino acid residue Lys ,M,B capable of hydrogen bonding or 
ionic interaction; 

providing a data base containing molecules coded for spatial occupancy, relative 
atomic position, bond type and/or charge; and 

screening said data base to select those molecules comprising moieties having a 
three dimensional conformation and effective charge that can bind with at least three of said 
five binding areas. 



NTR 



64. A method of identifying a ligand to bind with common neurotrophin receptor p75 
comprising: 

providing a three dimensional conformation for the common neurotrophin receptor 
p75 NTR , wherein p75 NTR comprises a binding site including amino acid residues Cys 39p to 
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Cys MD inclusive, said residues of said binding site having spatial coordinates given in 
Appendix 3, said binding site having three binding areas including binding loop 2A comprising 
region Cys 39 " to Cys 58 " capable of attractive electrostatic interaction, binding loop 2B 
comprising region Cys 58 " to Cys 79p capable of attractive electrostatic interaction, and binding 
loop 3A comprising region Cys 79p to Cys 94B capable of attractive electrostatic interaction; 

providing a data base containing molecules coded for spatial occupancy, relative 
atomic position, bond type and/or charge; and 

screening said data base to select those molecules comprising moieties that can bind 
with at least two of said three binding loops. 
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